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Introduction 


The  emphasis  in  this  course  is  on  problems — doing  calculations  and  story  problems.  To 
master  problem  solving  one  needs  a  tremendous  amount  of  practice  doing  problems.  The 
more  problems  you  do  the  better  you  will  be  at  doing  them,  as  patterns  will  start  to  emerge 
in  both  the  problems  and  in  successful  approaches  to  them.  You  will  learn  fastest  and  best 
if  you  devote  some  time  to  doing  problems  every  day. 

Typically  the  most  difficult  problems  are  story  problems,  since  they  require  some  effort 
before  you  can  begin  calculating.  Here  are  some  pointers  for  doing  story  problems: 

1.  Carefully  read  each  problem  twice  before  writing  anything. 

2.  Assign  letters  to  quantities  that  are  described  only  in  words;  draw  a  diagram  if 
appropriate. 

3.  Decide  which  letters  are  constants  and  which  are  variables.  A  letter  stands  for  a 
constant  if  its  value  remains  the  same  throughout  the  problem. 

4.  Using  mathematical  notation,  write  down  what  you  know  and  then  write  down 
what  you  want  to  find. 

5.  Decide  what  category  of  problem  it  is  (this  might  be  obvious  if  the  problem  comes 
at  the  end  of  a  particular  chapter,  but  will  not  necessarily  be  so  obvious  if  it  comes 
on  an  exam  covering  several  chapters). 

6.  Double  check  each  step  as  you  go  along;  don't  wait  until  the  end  to  check  your 
work. 

7.  Use  common  sense;  if  an  answer  is  out  of  the  range  of  practical  possibilities,  then 
check  your  work  to  see  where  you  went  wrong. 
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Suggestions  for  Using  This  Text 

1.  Read  the  example  problems  carefully,  filling  in  any  steps  that  are  left  out  (ask 
someone  for  help  if  you  can't  follow  the  solution  to  a  worked  example). 

2.  Later  use  the  worked  examples  to  study  by  covering  the  solutions,  and  seeing  if 
you  can  solve  the  problems  on  your  own. 

3.  Most  exercises  have  answers  in  Appendix  A;  the  availability  of  an  answer  is  marked 
by  at  the  end  of  the  exercise.  In  the  pdf  version  of  the  full  text,  clicking 
on  the  arrow  will  take  you  to  the  answer.  The  answers  should  be  used  only  as 
a  final  check  on  your  work,  not  as  a  crutch.  Keep  in  mind  that  sometimes  an 
answer  could  be  expressed  in  various  ways  that  are  algebraically  equivalent,  so 
don't  assume  that  your  answer  is  wrong  just  because  it  doesn't  have  exactly  the 
same  form  as  the  answer  in  the  back. 

4.  A  few  figures  in  the  pdf  and  print  versions  of  the  book  are  marked  with  "(AP)"  at 
the  end  of  the  caption.  Clicking  on  this  should  open  a  related  interactive  applet 
or  Sage  worksheet  in  your  web  browser.  Occasionally  another  link  will  do  the 
same  thing,  like  this  example.  (Note  to  users  of  a  printed  text:  the  words  "this 
example"  in  the  pdf  file  are  blue,  and  are  a  link  to  a  Sage  worksheet.) 


1 

Analytic  Geometry 


Much  of  the  mathematics  in  this  chapter  will  be  review  for  you.  However,  the  examples 
will  be  oriented  toward  applications  and  so  will  take  some  thought. 

In  the  [x,  y)  coordinate  system  we  normally  write  the  x-axis  horizontally,  with  positive 
numbers  to  the  right  of  the  origin,  and  the  y-axis  vertically,  with  positive  numbers  above 
the  origin.  That  is,  unless  stated  otherwise,  we  take  "rightward"  to  be  the  positive  x- 
direction  and  "upward"  to  be  the  positive  y-direction.  In  a  purely  mathematical  situation, 
we  normally  choose  the  same  scale  for  the  x-  and  y-ax.es.  For  example,  the  line  joining  the 
origin  to  the  point  (a,  a)  makes  an  angle  of  45°  with  the  x-axis  (and  also  with  the  y-axis). 

In  applications,  often  letters  other  than  x  and  y  are  used,  and  often  different  scales  are 
chosen  in  the  horizontal  and  vertical  directions.  For  example,  suppose  you  drop  something 
from  a  window,  and  you  want  to  study  how  its  height  above  the  ground  changes  from 
second  to  second.  It  is  natural  to  let  the  letter  t  denote  the  time  (the  number  of  seconds 
since  the  object  was  released)  and  to  let  the  letter  h  denote  the  height.  For  each  t  (say, 
at  one-second  intervals)  you  have  a  corresponding  height  h.  This  information  can  be 
tabulated,  and  then  plotted  on  the  (t,  h)  coordinate  plane,  as  shown  in  figure  1.0.1. 

We  use  the  word  "quadrant"  for  each  of  the  four  regions  into  which  the  plane  is 
divided  by  the  axes:  the  first  quadrant  is  where  points  have  both  coordinates  positive, 
or  the  "northeast"  portion  of  the  plot,  and  the  second,  third,  and  fourth  quadrants  are 
counted  off  counterclockwise,  so  the  second  quadrant  is  the  northwest,  the  third  is  the 
southwest,  and  the  fourth  is  the  southeast. 

Suppose  we  have  two  points  A  and  B  in  the  {x,y)-p\ane.  We  often  want  to  know  the 
change  in  x-coordinate  (also  called  the  "horizontal  distance")  in  going  from  A  to  B.  This 


15 


16     Chapter  1  Analytic  Geometry 


seconds 

0 

1 

2 

3 

4 

meters 

80 

75.1 

60.4 

35.9 

1.6 

t 


0  12  3  4 


Figure  1.0.1     A  data  plot,  height  versus  time. 

is  often  written  Ax,  where  the  meaning  of  A  (a  capital  delta  in  the  Greek  alphabet)  is 
"change  in" .  (Thus,  Ax  can  be  read  as  "change  in  x"  although  it  usually  is  read  as  "delta 
x" .  The  point  is  that  Ax  denotes  a  single  number,  and  should  not  be  interpreted  as  "delta 
times  x" .)  For  example,  if  A  =  (2,1)  and  B  =  (3,3),  Ax  =  3  —  2  =  1.  Similarly,  the 
"change  in  y"  is  written  Ay.  In  our  example.  Ay  =  3  —  1  =  2,  the  difference  between  the 
y-coordinates  of  the  two  points.  It  is  the  vertical  distance  you  have  to  move  in  going  from 
A  to  B.  The  general  formulas  for  the  change  in  x  and  the  change  in  y  between  a  point 
{xi,yi)  and  a  point  {x2,y2)  are: 

Ax  =  X2-xi,  Ay  =  y2-yi. 

Note  that  either  or  both  of  these  might  be  negative. 

1.1  Lines 

If  we  have  two  points  A{xi,y\)  and  B{x2,y2),  then  we  can  draw  one  and  only  one  line 
through  both  points.  By  the  slope  of  this  line  we  mean  the  ratio  of  Ay  to  Ax.  The  slope 
is  often  denoted  m:  m  =  Ay/ Ax  =  {y2  —  yi)/{x2  —  xi).  For  example,  the  line  joining  the 
points  (1,  -2)  and  (3,  5)  has  slope  (5  +  2)/(3  -  1)  =  7/2. 

EXAMPLE  1.1.1  According  to  the  1990  U.S.  federal  income  tax  schedules,  a  head 
of  household  paid  15%  on  taxable  income  up  to  $26050.  If  taxable  income  was  between 
$26050  and  $134930,  then,  in  addition,  28%  was  to  be  paid  on  the  amount  between  $26050 
and  $67200,  and  33%  paid  on  the  amount  over  $67200  (if  any).  Interpret  the  tax  bracket 
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information  (15%,  28%,  or  33%)  using  mathematical  terminology,  and  graph  the  tax  on 
the  y-axis  against  the  taxable  income  on  the  x-axis. 

The  percentages,  when  converted  to  decimal  values  0.15,  0.28,  and  0.33,  are  the  slopes 
of  the  straight  lines  which  form  the  graph  of  the  tax  for  the  corresponding  tax  brackets. 
The  tax  graph  is  what's  called  a  polygonal  line,  i.e.,  it's  made  up  of  several  straight  line 
segments  of  different  slopes.  The  first  line  starts  at  the  point  (0,0)  and  heads  upward 
with  slope  0.15  (i.e.,  it  goes  upward  15  for  every  increase  of  100  in  the  x-direction),  until 
it  reaches  the  point  above  x  —  26050.  Then  the  graph  "bends  upward,"  i.e.,  the  slope 
changes  to  0.28.  As  the  horizontal  coordinate  goes  from  x  =  26050  to  a;  =  67200,  the  line 
goes  upward  28  for  each  100  in  the  x-direction.  At  a;  =  67200  the  line  turns  upward  again 
and  continues  with  slope  0.33.  See  figure  1.1.1.  □ 


The  most  familiar  form  of  the  equation  of  a  straight  line  is:  y  —  mx  +  b.  Here  m  is  the 
slope  of  the  line:  if  you  increase  x  by  1,  the  equation  tells  you  that  you  have  to  increase  y 
by  m.  If  you  increase  x  by  Ax,  then  y  increases  by  Ay  =  mAx.  The  number  b  is  called 
the  y-intercept,  because  it  is  where  the  line  crosses  the  y-axis.  If  you  know  two  points 
on  a  line,  the  formula  m  =  {y2  —  yi) / {x2  —  xi)  gives  you  the  slope.  Once  you  know  a  point 
and  the  slope,  then  the  y-intercept  can  be  found  by  substituting  the  coordinates  of  either 
point  in  the  equation:  yi  =  mxi  +  6,  i.e.,  b  =  yi  —  mxi.  Alternatively,  one  can  use  the 
"point-slope"  form  of  the  equation  of  a  straight  line:  start  with  {y  —  yi)/{x  —  xi)  =  m  and 
then  multiply  to  get  {y  —  yi)  =  m{x  —  xi),  the  point-slope  form.  Of  course,  this  may  be 
further  manipulated  to  get  y  =  mx  —  mxi  +  yi,  which  is  essentially  the  "mx  -f-  6"  form. 

It  is  possible  to  find  the  equation  of  a  line  between  two  points  directly  from  the  relation 
(y  —  yi)/{x  —  xi)  =  (2/2  —  Hi)/ {^2  —  xi),  which  says  "the  slope  measured  between  the  point 
(xi,  2/1)  and  the  point  (x2, 2/2)  is  the  same  as  the  slope  measured  between  the  point  (xi,  yi) 
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Figure  1.1.1     Tax  vs.  income. 
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and  any  other  point  (x,  y)  on  the  hne."  For  example,  if  we  want  to  find  the  equation  of 
the  hne  joining  our  earher  points  A(2, 1)  and  -B(3,  3),  we  can  use  this  formula: 


y-l  3-1 


so  that 


X 


y-l  =  2(x-2) 


I.e., 


y  =  2x  —  3. 


Of  course,  this  is  really  just  the  point-slope  formula,  except  that  we  are  not  computing  m 
in  a  separate  step. 

The  slope  m  of  a  line  in  the  form  y  =  mx  +  h  tells  us  the  direction  in  which  the  line  is 
pointing.  If  m  is  positive,  the  line  goes  into  the  1st  quadrant  as  you  go  from  left  to  right. 
If  m  is  large  and  positive,  it  has  a  steep  incline,  while  if  m  is  small  and  positive,  then  the 
line  has  a  small  angle  of  inclination.  If  m  is  negative,  the  line  goes  into  the  4th  quadrant 
as  you  go  from  left  to  right.  If  m  is  a  large  negative  number  (large  in  absolute  value),  then 
the  line  points  steeply  downward;  while  if  m  is  negative  but  near  zero,  then  it  points  only 
a  little  downward.  These  four  possibilities  are  illustrated  in  figure  1.1.2. 
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Figure  1.1.2     Lines  with  slopes  3,  0.1,  —4,  and  —0.1. 

If  m  =  0,  then  the  line  is  horizontal:  its  equation  is  simply  y  =  b. 

There  is  one  type  of  line  that  cannot  be  written  in  the  form  y  =  mx  +  b,  namely, 
vertical  lines.  A  vertical  line  has  an  equation  of  the  form  x  =  a.  Sometimes  one  says  that 
a  vertical  line  has  an  "infinite"  slope. 

Sometimes  it  is  useful  to  find  the  x-intercept  of  a  line  y  =  mx  +  b.  This  is  the  x-value 
when  y  =  0.  Setting  mx  +  b  equal  to  0  and  solving  for  x  gives:  x  =  —b/m.  For  example, 
the  line  y  =  2x  —  3  through  the  points  A(2, 1)  and  -B(3,  3)  has  x-intercept  3/2. 

EXAMPLE  1.1.2  Suppose  that  you  are  driving  to  Seattle  at  constant  speed,  and  notice 
that  after  you  have  been  traveling  for  1  hour  (i.e.,  t  =  1),  you  pass  a  sign  saying  it  is  110 
miles  to  Seattle,  and  after  driving  another  half-hour  you  pass  a  sign  saying  it  is  85  miles 
to  Seattle.  Using  the  horizontal  axis  for  the  time  t  and  the  vertical  axis  for  the  distance  y 
from  Seattle,  graph  and  find  the  equation  y  =  mt  +  b  for  your  distance  from  Seattle.  Find 
the  slope,  y-intercept,  and  t-intercept,  and  describe  the  practical  meaning  of  each. 

The  graph  of  y  versus  i  is  a  straight  line  because  you  are  traveling  at  constant  speed. 
The  line  passes  through  the  two  points  (1,110)  and  (1.5,85),  so  its  slope  is  m  =  (85  — 
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110)/(1.5  —  1)  =  —50.  The  meaning  of  the  slope  is  that  you  are  travehng  at  50  mph;  m  is 
negative  because  you  are  travehng  toward  Seattle,  i.e.,  your  distance  y  is  decreasing.  The 
word  "velocity"  is  often  used  for  m  =  —50,  when  we  want  to  indicate  direction,  while  the 
word  "speed"  refers  to  the  magnitude  (absolute  value)  of  velocity,  which  is  50  mph.  To 
find  the  equation  of  the  line,  we  use  the  point-slope  formula: 

^^Y~Y'  ^  ^°  ^  "  "^^'^^  -  1)  +  110  =  -50t  +  160. 

The  meaning  of  the  y-intercept  160  is  that  when  t  =  0  (when  you  started  the  trip)  you  were 
160  miles  from  Seattle.  To  find  the  t-intercept,  set  0  =  -50t+160,  so  that  t  =  160/50  =  3.2. 
The  meaning  of  the  t-intercept  is  the  duration  of  your  trip,  from  the  start  until  you  arrive 
in  Seattle.  After  traveling  3  hours  and  12  minutes,  your  distance  y  from  Seattle  will  be  0. 

□ 


Exercises  1.1. 

1.  Find  the  equation  of  the  line  through  (1, 1)  and  (—5,  —3)  in  the  form  y  =  mx  +  6.  =^> 

2.  Find  the  equation  of  the  line  through  (—1,  2)  with  slope  —2  in  the  form  y  =  mx  +  6. 

3.  Find  the  equation  of  the  line  through  (—1, 1)  and  (5,  —3)  in  the  form  y  =  mx  +  6.  =^> 

4.  Change  the  equation  y  —  2a;  =  2  to  the  form  y  =  mx  +  6,  graph  the  hne,  and  find  the 
y- intercept  and  a;-intercept. 

5.  Change  the  equation  x  +  y  =  6  to  the  form  y  =  mx  +  b,  graph  the  line,  and  find  the  y-intercept 
and  x-intercept.  =^ 

6.  Change  the  equation  x  =  2y  —  1  to  the  form  y  =  mx  +  6,  graph  the  hne,  and  find  the 
y-intercept  and  x-intercept.  =^ 

7.  Change  the  equation  3  =  2y  to  the  form  y  =  mx  +  b,  graph  the  hne,  and  find  the  y-intercept 
and  x-intercept.  =^> 

8.  Change  the  equation  2x  -|-  3y  -|-  6  =  0  to  the  form  y  =  mx  +  b,  graph  the  line,  and  find  the 
y-intercept  and  x-intercept.  =^ 

9.  Determine  whether  the  lines  3x  -|-  6y  =  7  and  2x  -|-  4y  =  5  are  parallel. 

10.  Suppose  a  triangle  in  the  x,  y-plane  has  vertices  (—1,  0),  (1,  0)  and  (0,  2).  Find  the  equations 
of  the  three  lines  that  lie  along  the  sides  of  the  triangle  in  y  =  mx  +  b  form.  =^ 

11.  Suppose  that  you  are  driving  to  Seattle  at  constant  speed.  After  you  have  been  traveling 
for  an  hour  you  pass  a  sign  saying  it  is  130  miles  to  Seattle,  and  after  driving  another  20 
minutes  you  pass  a  sign  saying  it  is  105  miles  to  Seattle.  Using  the  horizontal  axis  for  the 
time  t  and  the  vertical  axis  for  the  distance  y  from  your  starting  point,  graph  and  find  the 
equation  y  =  mt  +  b  for  your  distance  from  your  starting  point.  How  long  does  the  trip  to 
Seattle  take? 

12.  Let  X  stand  for  temperature  in  degrees  Celsius  (centigrade),  and  let  y  stand  for  temperature  in 
degrees  Fahrenheit.  A  temperature  of  0°C  corresponds  to  32° F,  and  a  temperature  of  100° C 
corresponds  to  212°F.  Find  the  equation  of  the  line  that  relates  temperature  Fahrenheit  y  to 
temperature  Celsius  x  in  the  form  y  =  mx  +  b.  Graph  the  line,  and  find  the  point  at  which 
this  line  intersects  y  =  x.  What  is  the  practical  meaning  of  this  point?  =^> 
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13.  A  car  rental  firm  has  the  following  charges  for  a  certain  type  of  car:  $25  per  day  with  100 
free  miles  included,  $0.15  per  mile  for  more  than  100  miles.  Suppose  you  want  to  rent  a 
car  for  one  day,  and  you  know  you'll  use  it  for  more  than  100  miles.  What  is  the  equation 
relating  the  cost  y  to  the  number  of  miles  x  that  you  drive  the  car? 

14.  A  photocopy  store  advertises  the  following  prices:  5^  per  copy  for  the  first  20  copies,  per 
copy  for  the  21st  through  100th  copy,  and  3^  per  copy  after  the  100th  copy.  Let  x  be  the 
number  of  copies,  and  let  y  be  the  total  cost  of  photocopying,  (a)  Graph  the  cost  as  x  goes 
from  0  to  200  copies,  (b)  Find  the  equation  in  the  form  y  =  mx  +  b  that  tells  you  the  cost 
of  making  x  copies  when  x  is  more  than  100.  =^ 

15.  In  the  Kingdom  of  Xyg  the  tax  system  works  as  follows.  Someone  who  earns  less  than  100 
gold  coins  per  month  pays  no  tax.  Someone  who  earns  between  100  and  1000  gold  coins 
pays  tax  equal  to  10%  of  the  amount  over  100  gold  coins  that  he  or  she  earns.  Someone 
who  earns  over  1000  gold  coins  must  hand  over  to  the  King  all  of  the  money  earned  over 
1000  in  addition  to  the  tax  on  the  first  1000.  (a)  Draw  a  graph  of  the  tax  paid  y  versus  the 
money  earned  x,  and  give  formulas  for  y  in  terms  of  x  in  each  of  the  regions  0  <  x  <  100, 
100  <  X  <  1000,  and  x  >  1000.  (b)  Suppose  that  the  King  of  Xyg  decides  to  use  the  second 
of  these  line  segments  (for  100  <  x  <  1000)  for  x  <  100  as  well.  Explain  in  practical  terms 
what  the  King  is  doing,  and  what  the  meaning  is  of  the  y-intercept.  =^ 

16.  The  tax  for  a  single  taxpayer  is  described  in  the  figure  1.1.3.  Use  this  information  to  graph 
tax  versus  taxable  income  (i.e.,  x  is  the  amount  on  Form  1040,  line  37,  and  y  is  the  amount  on 
Form  1040,  line  38).  Find  the  slope  and  y- intercept  of  each  line  that  makes  up  the  polygonal 


graph,  up  to  a;  =  97620.  =^> 

1990  Tax  Rate  Schedules 

Schedule  X — Use  if  your  filing  status  is 
Single 

If  the  amount                     Enter  on                   of  the 
on  Form  1040     But  not    Form  1040  amount 
hne  37  is  over:    over:         hne  38  over: 

Schedule  Z — Use  if  your  filing  status  is 
Head  of  household 

If  the  amount                     Enter  on                   of  the 
on  Form  1040     But  not    Form  1040  amount 
hne  37  is  over:    over:         line  38  over: 

$0     $19,450                          15%  $0 
19,450      47,050      $2,917.50+28%  19,450 
47,050      97,620    $10,645.50+33%  47,050 

$0     $26,050                          15%  $0 
26,050      67,200      $3,907.50+28%  26,050 
67,200     134,930    $15,429.50+33%  67,200 

Use  Worksheet 

97,620                     below  to  figure 

your  tax 

Use  Worksheet 

134,930                     below  to  figure 

your  tax 

Figure  1.1.3     Tax  Schedule. 


17.  Market  research  tells  you  that  if  you  set  the  price  of  an  item  at  $1.50,  you  will  be  able  to  sell 
5000  items;  and  for  every  10  cents  you  lower  the  price  below  $1.50  you  will  be  able  to  sell 
another  1000  items.  Let  x  be  the  number  of  items  you  can  sell,  and  let  P  be  the  price  of  an 
item,  (a)  Express  P  linearly  in  terms  of  x,  in  other  words,  express  P  in  the  form  P  =  mx  +  b. 
(b)  Express  x  linearly  in  terms  of  P.  =^ 

18.  An  instructor  gives  a  100-point  final  exam,  and  decides  that  a  score  90  or  above  will  be  a 
grade  of  4.0,  a  score  of  40  or  below  will  be  a  grade  of  0.0,  and  between  40  and  90  the  grading 
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will  be  linear.  Let  x  be  the  exam  score,  and  let  y  be  the  corresponding  grade.  Find  a  formula 
of  the  form  y  =  mx  +  b  which  applies  to  scores  x  between  40  and  90. 

1.2    Distance  Between  Two  Points;  Circles 

Given  two  points  {xi,yi)  and  {x2, 1/2) ,  recall  that  their  horizontal  distance  from  one  another 
is  Ax  =  X2  —  X1  and  their  vertical  distance  from  one  another  is  Ay  =  y2  —  yi-  (Actually,  the 
word  "distance"  normally  denotes  "positive  distance".  Ax  and  Ay  are  signed  distances, 
but  this  is  clear  from  context.)  The  actual  (positive)  distance  from  one  point  to  the  other 
is  the  length  of  the  hypotenuse  of  a  right  triangle  with  legs  |Aa;|  and  \Ay\,  as  shown  in 
figure  1.2.1.  The  Pythagorean  theorem  then  says  that  the  distance  between  the  two  points 
is  the  square  root  of  the  sum  of  the  squares  of  the  horizontal  and  vertical  sides: 

distance  =  a/ (Ax)^  +  {Ay^  =  a/ (x2  -  xi)^  +  (2/2  -  yi)^. 

For  example,  the  distance  between  points  A{2, 1)  and  -8(3,  3)  is  ^^(3  —  2)^  +  (3  —  1)^  = 


Figure  1.2.1     Distance  between  two  points,  Ax  and  Ay  positive. 

As  a  special  case  of  the  distance  formula,  suppose  we  want  to  know  the  distance  of  a 
point  (x,  y)  to  the  origin.  According  to  the  distance  formula,  this  is  \/ {x  —  0)^  +  {y  —  0)^  = 
^/x'^  +  2/2. 

A  point  (x,  y)  is  at  a  distance  r  from  the  origin  if  and  only  if  x'^  +  y"^  =  r,  or,  if  we 
square  both  sides:  x^  +  j/^  =  r^.  This  is  the  equation  of  the  circle  of  radius  r  centered  at 
the  origin.  The  special  case  r  =  1  is  called  the  unit  circle;  its  equation  is  x^  +     =  1. 

Similarly,  if  C{h,  k)  is  any  fixed  point,  then  a  point  {x,y)  is  at  a  distance  r  from  the 
point  C  if  and  only  if  a/(x  —  h)"^  +  {y  —  k)"^  =  r,  i.e.,  if  and  only  if 

(x-h)^  +  {y-k)^  =  r^. 

This  is  the  equation  of  the  circle  of  radius  r  centered  at  the  point  {h,  k).  For  example,  the 

circle  of  radius  5  centered  at  the  point  (0,  —6)  has  equation  (x  —  0)^  +  {y  6)^  =  25,  or 

x2  +  (y  +  6)2  =  25.  If  we  expand  this  we  get  x'^  +  y'^  +  12y  +  36  =  25  or  x"^  +  y'^  +  12y  +  ll  =  0, 
but  the  original  form  is  usually  more  useful. 


22     Chapter  1  Analytic  Geometry 


EXAMPLE  1.2.1  Graph  the  circle  x"^  -  2x  +  y"^  +  4y  -  11  =  0.  With  a  httle  thought 
we  convert  this  to  {x  —  1)^  +  +  2)^  —  16  =  0  or  {x  —  1)^  +  {y  +  2)^  =  16.  Now  we  see 
that  this  is  the  circle  with  radius  4  and  center  (1,-2),  which  is  easy  to  graph.  □ 


Exercises  1.2. 

1.  Find  the  equation  of  the  circle  of  radius  3  centered  at: 

a)  (0,0)  d)  (0,3) 

b)  (5,6)  e)  (0,-3) 

c)  (-5,-6)  f)  (3,0) 

2.  For  each  pair  of  points  A(xi,yi)  and  B{x2,y2)  find  (i)  Ax  and  Ay  in  going  from  A  to  B, 
(ii)  the  slope  of  the  line  joining  A  and  B,  (iii)  the  equation  of  the  line  joining  A  and  B  in 
the  form  y  =  mx  +  b,  (iv)  the  distance  from  A  to  B,  and  (v)  an  equation  of  the  circle  with 
center  at  A  that  goes  through  B. 

a)  A{2,  0),  5(4,  3)  d)  A{-2,  3),  5(4,  3) 

b)  ^(1,  -1),  B{0,  2)  e)  Ai-3,  -2),  5(0,  0) 

c)  ^(0,0),  5(-2,-2)  f)  ^(0.01,-0.01),  5(-0.01,  0.05) 

=^ 

3.  Graph  the  circle  x^  +  y^  +  lOy  =  0. 

4.  Graph  the  circle  x^  -  Wx  +     =  24. 

5.  Graph  the  circle  x'^  —  6x  +  y'^  —  8y  =  0. 

6.  Find  the  standard  equation  of  the  circle  passing  through  (—2, 1)  and  tangent  to  the  line 
3a;  —  2y  =  6  at  the  point  (4,  3).  Sketch.  (Hint:  The  line  through  the  center  of  the  circle  and 
the  point  of  tangency  is  perpendicular  to  the  tangent  line.)  =^ 


1.3  Functions 

A  function  y  =  f{x)  is  a  rule  for  determining  y  when  we're  given  a  value  of  x.  For 
example,  the  rule  y  =  f{x)  =  2a;  +  1  is  a  function.  Any  line  y  =  mx  +  6  is  called  a  linear 
function.  The  graph  of  a  function  looks  like  a  curve  above  (or  below)  the  x-axis,  where 
for  any  value  of  x  the  rule  y  =  f{x)  tells  us  how  far  to  go  above  (or  below)  the  a;-axis  to 
reach  the  curve. 

Functions  can  be  defined  in  various  ways:  by  an  algebraic  formula  or  several  algebraic 
formulas,  by  a  graph,  or  by  an  experimentally  determined  table  of  values.  (In  the  latter 
case,  the  table  gives  a  bunch  of  points  in  the  plane,  which  we  might  then  interpolate  with 
a  smooth  curve,  if  that  makes  sense.) 

Given  a  value  of  x,  a  function  must  give  at  most  one  value  of  y.  Thus,  vertical  lines 
are  not  functions.  For  example,  the  line  x  =  1  has  infinitely  many  values  of  y  if  x  =  1.  It 
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is  also  true  that  if  x  is  any  number  not  1  there  is  no  y  which  corresponds  to  x,  but  that  is 
not  a  problem — only  multiple  y  values  is  a  problem. 

In  addition  to  lines,  another  familiar  example  of  a  function  is  the  parabola  y  =  f(x)  = 
x^.  We  can  draw  the  graph  of  this  function  by  taking  various  values  of  x  (say,  at  regular 
intervals)  and  plotting  the  points  =  Then  connect  the  points  with  a 

smooth  curve.  (See  figure  1.3.1.) 

The  two  examples  y  =  f{x)  =  2x  +  1  and  y  =  f{x)  =  are  both  functions  which 
can  be  evaluated  at  any  value  of  x  from  negative  infinity  to  positive  infinity.  For  many 
functions,  however,  it  only  makes  sense  to  take  x  in  some  interval  or  outside  of  some 
"forbidden"  region.  The  interval  of  x- values  at  which  we're  allowed  to  evaluate  the  function 
is  called  the  domain  of  the  function. 


y  =  fix)  =x^  y  =  fix)  =  y/^  y  =  fix)  =  1/x 

Figure  1.3.1     Some  graphs. 


For  example,  the  square- root  function  y  =  fix)  =  ^Jx  is  the  rule  which  says,  given  an 
X- value,  take  the  nonnegative  number  whose  square  is  x.  This  rule  only  makes  sense  if  x 
is  positive  or  zero.  We  say  that  the  domain  of  this  function  is  x  >  0,  or  more  formally 
{x  G  M  I  X  >  0}.  Alternately,  we  can  use  interval  notation,  and  write  that  the  domain  is 
[0,  oo).  (In  interval  notation,  square  brackets  mean  that  the  endpoint  is  included,  and  a 
parenthesis  means  that  the  endpoint  is  not  included.)  The  fact  that  the  domain  oiy  =  ^x 
is  [0,  cxd)  means  that  in  the  graph  of  this  function  ((see  figure  1.3.1)  we  have  points  (x,  y) 
only  above  x- values  on  the  right  side  of  the  x-axis. 

Another  example  of  a  function  whose  domain  is  not  the  entire  x-axis  is:  y  =  fix)  = 
1/x,  the  reciprocal  function.  We  cannot  substitute  x  =  0  in  this  formula.  The  function 
makes  sense,  however,  for  any  nonzero  x,  so  we  take  the  domain  to  be:  {x  G  M  |  x  7^  0}. 
The  graph  of  this  function  does  not  have  any  point  (x,  y)  with  x  =  0.  As  x  gets  close  to 
0  from  either  side,  the  graph  goes  off  toward  infinity.  We  call  the  vertical  line  x  =  0  an 
asymptote. 

To  summarize,  two  reasons  why  certain  x-values  are  excluded  from  the  domain  of  a 
function  are  that  (i)  we  cannot  divide  by  zero,  and  (ii)  we  cannot  take  the  square  root 
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of  a  negative  number.  We  will  encounter  some  other  ways  in  which  functions  might  be 
undefined  later. 

Another  reason  why  the  domain  of  a  function  might  be  restricted  is  that  in  a  given 
situation  the  x- values  outside  of  some  range  might  have  no  practical  meaning.  For  example, 
if  y  is  the  area  of  a  square  of  side  x,  then  we  can  write  y  =  f{x)  =  .  In  a  purely 
mathematical  context  the  domain  of  the  function  y  =  is  all  of  M.  But  in  the  story- 
problem  context  of  finding  areas  of  squares,  we  restrict  the  domain  to  positive  values  of  x, 
because  a  square  with  negative  or  zero  side  makes  no  sense. 

In  a  problem  in  pure  mathematics,  we  usually  take  the  domain  to  be  all  values  of  x 
at  which  the  formulas  can  be  evaluated.  But  in  a  story  problem  there  might  be  further 
restrictions  on  the  domain  because  only  certain  values  of  x  are  of  interest  or  make  practical 
sense. 

In  a  story  problem,  often  letters  difi^erent  from  x  and  y  are  used.  For  example,  the 
volume  F  of  a  sphere  is  a  function  of  the  radius  r,  given  by  the  formula  V  =  f{r)  =  '^/sTrr^. 
Also,  letters  different  from  /  may  be  used.  For  example,  if  y  is  the  velocity  of  something 
at  time  t,  we  may  write  y  =  v{t)  with  the  letter  v  (instead  of  /)  standing  for  the  velocity 
function  (and  t  playing  the  role  of  x). 

The  letter  playing  the  role  of  x  is  called  the  independent  variable,  and  the  letter 
playing  the  role  of  y  is  called  the  dependent  variable  (because  its  value  "depends  on" 
the  value  of  the  independent  variable).  In  story  problems,  when  one  has  to  translate  from 
English  into  mathematics,  a  crucial  step  is  to  determine  what  letters  stand  for  variables. 
If  only  words  and  no  letters  are  given,  then  we  have  to  decide  which  letters  to  use.  Some 
letters  are  traditional.  For  example,  almost  always,  t  stands  for  time. 

EXAMPLE  1.3.1  An  open-top  box  is  made  from  an  a  x  6  rectangular  piece  of  cardboard 
by  cutting  out  a  square  of  side  x  from  each  of  the  four  corners,  and  then  folding  the  sides 
up  and  sealing  them  with  duct  tape.  Find  a  formula  for  the  volume  V  of  the  box  as  a 
function  of  x,  and  find  the  domain  of  this  function. 

The  box  we  get  will  have  height  x  and  rectangular  base  of  dimensions  a  —  2a;  by  6  —  2x. 
Thus, 

V  =  f{x)  =  x{a  -  2x){b  -  2x). 

Here  a  and  b  are  constants,  and  V  is  the  variable  that  depends  on  x,  i.e.,  V  is  playing  the 
role  of  y. 

This  formula  makes  mathematical  sense  for  any  x,  but  in  the  story  problem  the  domain 
is  much  less.  In  the  first  place,  x  must  be  positive.  In  the  second  place,  it  must  be  less 
than  half  the  length  of  either  of  the  sides  of  the  cardboard.  Thus,  the  domain  is 

1 

{x  e  M  I  0  <  a;  <  -(minimum  of  a  and  b)}. 
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In  interval  notation  we  write:  the  domain  is  the  interval  (0,  min(a,  b)/2).  (You  might  think 
about  whether  we  could  allow  0  or  min(a,  b)/2  to  be  in  the  domain.  They  make  a  certain 
physical  sense,  though  we  normally  would  not  call  the  result  a  box.  If  we  were  to  allow 
these  values,  what  would  the  corresponding  volumes  be?  Does  that  make  sense?)  □ 

EXAMPLE  1.3.2  Circle  of  radius  r  centered  at  the  origin  The  equation  for 
this  circle  is  usually  given  in  the  form  +  =  r^.  To  write  the  equation  in  the  form 
y  =  f{x)  we  solve  for  y,  obtaining  y  =  ±a/ —  x^.  But  this  is  not  a  function,  because 
when  we  substitute  a  value  in  (— r,  r)  for  x  there  are  two  corresponding  values  of  y.  To  get 
a  function,  we  must  choose  one  of  the  two  signs  in  front  of  the  square  root.  If  we  choose 
the  positive  sign,  for  example,  we  get  the  upper  semicircle  y  =  f{x)  =  -\/r^  —  (see 
figure  1.3.2).  The  domain  of  this  function  is  the  interval  [— r,  r],  i.e.,  x  must  be  between  — r 
and  r  (including  the  endpoints).  If  x  is  outside  of  that  interval,  then  —  x'^  is  negative, 
and  we  cannot  take  the  square  root.  In  terms  of  the  graph,  this  just  means  that  there  are 
no  points  on  the  curve  whose  x-coordinate  is  greater  than  r  or  less  than  — r.  □ 


— r  r 


Figure  1.3.2     Upper  semicircle  y  =        —  x'^ 
EXAMPLE  1.3.3    Find  the  domain  of 

v4a;  — 

To  answer  this  question,  we  must  rule  out  the  x- values  that  make  4x  —  x'^  negative  (because 
we  cannot  take  the  square  root  of  a  negative  number)  and  also  the  x-values  that  make 
4x  —  zero  (because  if  4x  —  =  0,  then  when  we  take  the  square  root  we  get  0,  and 
we  cannot  divide  by  0) .  In  other  words,  the  domain  consists  of  all  x  for  which  4x  —  x^  is 
strictly  positive.  We  give  two  different  methods  to  find  out  when  4x  —  x^  >  0. 

First  method.  Factor  4x  (4  —  x).  The  product  of  two  numbers  is  positive 

when  either  both  are  positive  or  both  are  negative,  i.e.,  if  either  x  >  0  and  4  —  x  >  0, 
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or  else  x  <  0  and  4  —  x  <  0.  The  latter  alternative  is  impossible,  since  if  x  is  negative, 
then  4  —  X  is  greater  than  4,  and  so  cannot  be  negative.  As  for  the  first  alternative,  the 
condition  4  —  x  >  0  can  be  rewritten  (adding  x  to  both  sides)  as  4  >  x,  so  we  need:  x  >  0 
and  4  >  X  (this  is  sometimes  combined  in  the  form  4  >  x  >  0,  or,  equivalently,  0  <  x  <  4). 
In  interval  notation,  this  says  that  the  domain  is  the  interval  (0,4). 

Second  method.    Write  4x  —  x^  as  —  (x'^  —  4x),  and  then  complete  the  square, 

obtaining  —  ^(x  —  2)^  ~  ^  =  4  —  (x  —  2)^.  For  this  to  be  positive  we  need  (x  —  2)^  <  4, 
which  means  that  x  —  2  must  be  less  than  2  and  greater  than  —2:  —2  <  x  —  2  <  2.  Adding 
2  to  everything  gives  0  <  x  <  4.  Both  of  these  methods  are  equally  correct;  you  may  use 
either  in  a  problem  of  this  type.  □ 

A  function  does  not  always  have  to  be  given  by  a  single  formula,  as  we  have  already 
seen  (in  the  income  tax  problem,  for  example).  Suppose  that  y  =  v{t)  is  the  velocity 
function  for  a  car  which  starts  out  from  rest  (zero  velocity)  at  time  t  =  0;  then  increases 
its  speed  steadily  to  20  m/sec,  taking  10  seconds  to  do  this;  then  travels  at  constant  speed 
20  m/sec  for  15  seconds;  and  finally  applies  the  brakes  to  decrease  speed  steadily  to  0, 
taking  5  seconds  to  do  this.  The  formula  for  y  =  v{t)  is  different  in  each  of  the  three  time 
intervals:  first  y  =  2x,  then  y  =  20,  then  y  =  —Ax  +  120.  The  graph  of  this  function  is 
shown  in  figure  1.3.3. 


V 
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Figure  1.3.3     A  velocity  function. 

Not  all  functions  are  given  by  formulas  at  all.  A  function  can  be  given  by  an  ex- 
perimentally determined  table  of  values,  or  by  a  description  other  than  a  formula.  For 
example,  the  population  y  of  the  U.S.  is  a  function  of  the  time  t:  we  can  write  y  =  f{t). 
This  is  a  perfectly  good  function — we  could  graph  it  (up  to  the  present)  if  we  had  data  for 
various  t — but  we  can't  find  an  algebraic  formula  for  it. 
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Exercises  1.3. 

Find  the  domain  of  each  of  the  following  functions: 


1. 

11  = 
y 

fix) 

=  \/2x  —  3 

2. 

y  = 

fix) 

=         +  1)  ^ 

3. 

y  = 

fix) 

=  l/{x^  -  1)  = 

4. 

y  = 

fix) 

=  J-l/x  =^ 

V  / 

5. 

y 

fix) 

=  aVt  ^ 

6. 

y  = 

fix) 

=  ^  ^ 

7. 

y  = 

fix) 

=  \/r'^  —  [x  — 

,  where  r  and     are  positive  constants. 

8. 

y  = 

fix) 

=  Vl  -  i^x) 

=^ 

9. 

y  = 

fix) 

=  1/^1  -  (3a: 

10. 

y  = 

fix) 

=  A/a;  +  l/{x  - 

11. 

y  = 

fix) 

=  i/iV^-i) 

12.  Find  the  domain  of  h{x)  =  j  (^^^  -  9)/(a;  -  3)    x  /  3  ^ 

[6  if  X  =  3. 


13.  Suppose  f{x)  =  3x  —  9  and  g{x)  =  \/x.  What  is  the  domain  of  the  composition  {g  o  f){x)7 
(Recall  that  composition  is  defined  as  {g  o  f){x)  =  g{f{x)).)    What  is  the  domain  of 

ifog){x)l  ^ 

14.  A  farmer  wants  to  build  a  fence  along  a  river.  He  has  500  feet  of  fencing  and  wants  to  enclose 
a  rectangular  pen  on  three  sides  (with  the  river  providing  the  fourth  side).  If  x  is  the  length 
of  the  side  perpendicular  to  the  river,  determine  the  area  of  the  pen  as  a  function  of  x.  What 
is  the  domain  of  this  function?  =^ 

15.  A  can  in  the  shape  of  a  cylinder  is  to  be  made  with  a  total  of  100  square  centimeters  of 
material  in  the  side,  top,  and  bottom;  the  manufacturer  wants  the  can  to  hold  the  maximum 
possible  volume.  Write  the  volume  as  a  function  of  the  radius  r  of  the  can;  find  the  domain 
of  the  function.  =^ 

16.  A  can  in  the  shape  of  a  cylinder  is  to  be  made  to  hold  a  volume  of  one  liter  (1000  cubic 
centimeters).  The  manufacturer  wants  to  use  the  least  possible  material  for  the  can.  Write 
the  surface  area  of  the  can  (total  of  the  top,  bottom,  and  side)  as  a  function  of  the  radius  r 
of  the  can;  find  the  domain  of  the  function.  =^ 

1.4    Shifts  and  Dilations 

Many  functions  in  applications  are  built  up  from  simple  functions  by  inserting  constants 
in  various  places.  It  is  important  to  understand  the  effect  such  constants  have  on  the 
appearance  of  the  graph. 

Horizontal  shifts.  //  we  replace  x  by  x  —  C  everywhere  it  occurs  in  the  formula  for  fix), 
then  the  graph  shifts  over  C  to  the  right.  (If  C  is  negative,  then  this  means  that  the  graph 
shifts  over  \C\  to  the  left.)  For  example,  the  graph  ofy  =  (x  — 2)^  is  the  x^-parabola  shifted 
over  to  have  its  vertex  at  the  point  2  on  the  x-axis.  The  graph  of  y  =  ix  +  1)^  is  the  same 
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parabola  shifted  over  to  the  left  so  as  to  have  its  vertex  at  —1  on  the  x-axis.  Note  well: 
when  replacing  a;  by  a;  —  C  we  must  pay  attention  to  meaning,  not  merely  appearance. 
Starting  with  y  =  and  literally  replacing  a;  by  a;  —  2  gives  y  =  x  —  2'^.  This  is  j/  =  a;  —  4, 
a  line  with  slope  1,  not  a  shifted  parabola. 

Vertical  shifts.  If  we  replace  y  by  y  —  D,  then  the  graph  moves  up  D  units.  (If  D  is 
negative,  then  this  means  that  the  graph  moves  down  |D|  units.)  If  the  formula  is  written 
in  the  form  y  =  f{x)  and  if  y  is  replaced  hy  y  —  D  to  get  y  —  D  =  /(x),  we  can  equivalently 
move  D  to  the  other  side  of  the  equation  and  write  y  =  f{x)  +  D.  Thus,  this  principle 
can  be  stated:  to  get  the  graph  of  y  =  f{x)  +  D,  take  the  graph  of  y  =  f{x)  and  move  it 
D  units  up.  For  example,  the  function  y  =  x"^  —  4x  =  —  2)^  —  4  can  be  obtained  from 
y  —  {x  —  2)'^  (see  the  last  paragraph)  by  moving  the  graph  4  units  down.  The  result  is  the 
x^-parabola  shifted  2  units  to  the  right  and  4  units  down  so  as  to  have  its  vertex  at  the 
point  (2,  -4). 

Warning.  Do  not  confuse  f{x)  +  D  and  f{x  +  D).  For  example,  if  f{x)  is  the  function  x^, 
then  f{x)  +  2  is  the  function  x"^  +  2,  while  f{x  +  2)  is  the  function  {x  +  2)^  =     +  4a;  +  4. 

EXAMPLE  1.4.1  Circles  An  important  example  of  the  above  two  principles  starts 
with  the  circle  x'^  +  y^  =  r'^.  This  is  the  circle  of  radius  r  centered  at  the  origin.  (As  we 
saw,  this  is  not  a  single  function  y  =  f{x),  but  rather  two  functions  y  =  ±y/r^^^x^  put 
together;  in  any  case,  the  two  shifting  principles  apply  to  equations  like  this  one  that  are 
not  in  the  form  y  =  f{x).)  If  we  replace  a;  by  a;  —  C  and  replace  y  hy  y  —  D — getting  the 
equation  [x  —  C)'^  +  [y  —  D)'^  =  — the  effect  on  the  circle  is  to  move  it  C  to  the  right 
and  D  up,  thereby  obtaining  the  circle  of  radius  r  centered  at  the  point  {C,D).  This  tells 
us  how  to  write  the  equation  of  any  circle,  not  necessarily  centered  at  the  origin.  □ 

We  will  later  want  to  use  two  more  principles  concerning  the  effects  of  constants  on 
the  appearance  of  the  graph  of  a  function. 

Horizontal  dilation.  If  x  is  replaced  by  x/A  in  a  formula  and  A>  1,  then  the  effect  on 
the  graph  is  to  expand  it  by  a  factor  of  A  in  the  x-direction  (away  from  the  y-axis).  If  A 
is  between  0  and  1  then  the  effect  on  the  graph  is  to  contract  by  a  factor  of  1  /A  (towards 
the  j/-axis).  We  use  the  word  "dilate"  to  mean  expand  or  contract. 

For  example,  replacing  x  by  x/0.5  —  a;/ (1/2)  =  2x  has  the  effect  of  contracting  toward 
the  y-axis  by  a  factor  of  2.  If  A  is  negative,  we  dilate  by  a  factor  of  |^|  and  then  flip 
about  the  y-axis.  Thus,  replacing  x  by  — x  has  the  effect  of  taking  the  mirror  image  of  the 
graph  with  respect  to  the  y-axis.  For  example,  the  function  y  =  \/— a;,  which  has  domain 
{a;  G  M  I  X  <  0},  is  obtained  by  taking  the  graph  of  ^/x  and  flipping  it  around  the  y-axis 
into  the  second  quadrant. 


1.4    Shifts  and  Dilations  29 


Vertical  dilation.  If  y  is  replaced  by  y/B  in  a  formula  and  B  >  0,  then  the  effect  on 
the  graph  is  to  dilate  it  by  a  factor  of  B  in  the  vertical  direction.  As  before,  this  is  an 
expansion  or  contraction  depending  on  wtietlier  B  is  larger  or  smaller  than  one.  Note  that 
if  we  have  a  function  y  =  f{x),  replacing  y  by  y/B  is  equivalent  to  multiplying  the  function 
on  the  right  by  S:  y  =  Bf{x).  The  effect  on  the  graph  is  to  expand  the  picture  away  from 
the  X-axis  by  a  factor  of  S  if  S  >  1,  to  contract  it  toward  the  x-axis  by  a  factor  of  l/S  if 
0  <  S  <  1,  and  to  dilate  by       and  then  flip  about  the  x-axis  if  B  is  negative. 

EXAMPLE  1.4.2  Ellipses  A  basic  example  of  the  two  expansion  principles  is  given 
by  an  ellipse  of  semimajor  axis  a  and  semiminor  axis  b.  We  get  such  an  ellipse  by 
starting  with  the  unit  circle — the  circle  of  radius  1  centered  at  the  origin,  the  equation 
of  which  is  x^  +  =  1 — and  dilating  by  a  factor  of  a  horizontally  and  by  a  factor  of  b 
vertically.  To  get  the  equation  of  the  resulting  ellipse,  which  crosses  the  x-axis  at  ±a  and 
crosses  the  y-axis  at  ±6,  we  replace  x  by  x/a  and  y  by  y/b  in  the  equation  for  the  unit 
circle.  This  gives 


^  +(!)  =1        ;?  +  !j  =  i- 


□ 


Finally,  if  we  want  to  analyze  a  function  that  involves  both  shifts  and  dilations,  it 
is  usually  simplest  to  work  with  the  dilations  first,  and  then  the  shifts.  For  instance,  if 
we  want  to  dilate  a  function  by  a  factor  of  A  in  the  x-direction  and  then  shift  C  to  the 
right,  we  do  this  by  replacing  x  first  by  x/A  and  then  by  (x  —  C)  in  the  formula.  As  an 
example,  suppose  that,  after  dilating  our  unit  circle  by  a  in  the  x-direction  and  by  b  in  the 
y-direction  to  get  the  ellipse  in  the  last  paragraph,  we  then  wanted  to  shift  it  a  distance 
h  to  the  right  and  a  distance  k  upward,  so  as  to  be  centered  at  the  point  (/i,  k).  The  new 
ellipse  would  have  equation 

Note  well  that  this  is  different  than  first  doing  shifts  by  h  and  k  and  then  dilations  by  a 
and  b: 

See  figure  1.4.1. 
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Figure  1.4.1     Elhpses:  +  (^)^  =  1  on  the  left,  (f  -  l)^  +  (|  -  l)^  =  1  on  the 

right. 

Exercises  1.4- 

Starting  with  the  graph  of  y  =  y/x,  the  graph  of  y  —  1/x,  and  the  graph  of  y  =  y/l  —  x^  (the 
upper  unit  semicircle),  sketch  the  graph  of  each  of  the  following  functions: 

-l-l/(x  +  2) 


1. 

=  y/x-2 

2. 

fix) 

3. 

/(^•; 

=  A  +  ^/x  +  2 

4. 

y  = 

5. 

y  = 

f{x)  =  -V^ 

6. 

7. 

fix] 

=  -4+  \/-(x-2) 

8. 

fix) 

9. 

=  l/(x  +  l) 

10. 

fix) 

11. 

=  1  +  1/(2;- 1) 

12. 

fix) 

x/il 


X 


1)^ 


2yr 


(a;/3)2 

4  +  2^1-  (a: -5)2/9 
\/lOO  -  25{x  -1)2  +  2 

The  graph  of  f{x)  is  shown  below.  Sketch  the  graphs  of  the  following  functions. 


13. 

y  =  fix  -  1) 

2  ^ 

14. 

y  =  1  +  f{x  +  2) 

15. 

y  =  1  +  2/(x) 

1  - 

16. 

y  =  2/(3x) 

0- 

17. 

y  =  2/(3(x  -  2))  +  1 

18. 

y  =  (l/2)/(3x  -  3) 

-1  - 

19. 

y  =  /(l  +  x/3)  +  2 

2 

Instantaneous  Rate  of  Change: 
The  Derivative 


2.1    The  slope  of  a  function 

Suppose  that  y  is  a  function  of  x,  say  y  =  f{x).  It  is  often  necessary  to  know  how  sensitive 
the  value  of  y  is  to  small  changes  in  x. 

EXAMPLE  2.1.1  Take,  for  example,  y  ^  f{x)  ^  ^625  -  x^  (the  upper  semicircle  of 
radius  25  centered  at  the  origin).  When  x  =  7,  we  find  that  y  =  V625  -  49  =  24.  Suppose 
we  want  to  know  how  much  y  changes  when  x  increases  a  little,  say  to  7.1  or  7.01. 

In  the  case  of  a  straight  line  y  =  mx  +  b,  the  slope  m  =  Ay/ Ax  measures  the  change  in 
y  per  unit  change  in  x.  This  can  be  interpreted  as  a  measure  of  "sensitivity";  for  example, 
if  y  =  lOOx  +  5,  a  small  change  in  x  corresponds  to  a  change  one  hundred  times  as  large 
in  y,  so  y  is  quite  sensitive  to  changes  in  x. 

Let  us  look  at  the  same  ratio  Ay/ Ax  for  our  function  y  —  f{x)  —  a/ 625  —  when  x 
changes  from  7  to  7.1.  Here  Ax  =  7.1  —  7  =  0.1  is  the  change  in  x,  and 

Ay  =  fix  +  Ax)  -  fix)  =  /(7.1)  -  /(7) 

=  a/625  -  7.12  _  ^625  -  7^  ^  23.9706  -  24  =  -0.0294. 

Thus,  Ay/ Ax  ^  —0.0294/0.1  =  —0.294.  This  means  that  y  changes  by  less  than  one 
third  the  change  in  x,  so  apparently  y  is  not  very  sensitive  to  changes  in  a;  at  a;  =  7. 
We  say  "apparently"  here  because  we  don't  really  know  what  happens  between  7  and  7.1. 
Perhaps  y  changes  dramatically  as  x  runs  through  the  values  from  7  to  7.1,  but  at  7.1  y 
just  happens  to  be  close  to  its  value  at  7.  This  is  not  in  fact  the  case  for  this  particular 
function,  but  we  don't  yet  know  why.  □ 
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One  way  to  interpret  the  above  calculation  is  by  reference  to  a  line.  We  have  computed 
the  slope  of  the  line  through  (7,24)  and  (7.1,23.9706),  called  a  chord  of  the  circle.  In 
general,  if  we  draw  the  chord  from  the  point  (7,  24)  to  a  nearby  point  on  the  semicircle 
(7  +  Ax,  /(7  +  Ax)),  the  slope  of  this  chord  is  the  so-called  difference  quotient 

slope  of  chord  =       ^        '  f^''>  =  -  (7  +  A.)^  -  24^ 

Ax  Ax 

For  example,  if  x  changes  only  from  7  to  7.01,  then  the  difference  quotient  (slope  of  the 
chord)  is  approximately  equal  to  (23.997081  -  24)/0.01  =  -0.2919.  This  is  slightly  less 
steep  than  the  chord  from  (7,24)  to  (7.1,23.9706). 

As  the  second  value  7  +  Ax  moves  in  towards  7,  the  chord  joining  (7,  /(7))  to  (7  + 
Ax,  /(7  + Ax))  shifts  slightly.  As  indicated  in  figure  2.1.1,  as  Ax  gets  smaller  and  smaller, 
the  chord  joining  (7,  24)  to  (7  + Ax,  /(7  + Ax))  gets  closer  and  closer  to  the  tangent  line 
to  the  circle  at  the  point  (7,  24).  (Recall  that  the  tangent  line  is  the  line  that  just  grazes 
the  circle  at  that  point,  i.e.,  it  doesn't  meet  the  circle  at  any  second  point.)  Thus,  as  Ax 
gets  smaller  and  smaller,  the  slope  Ay/ Ax  of  the  chord  gets  closer  and  closer  to  the  slope 
of  the  tangent  line.  This  is  actually  quite  difficult  to  see  when  Ax  is  small,  because  of  the 
scale  of  the  graph.  The  values  of  Ax  used  for  the  figure  are  1,  5,  10  and  15,  not  really  very 
small  values.  The  tangent  line  is^the  one  that  is  uppermost  at  the  right  hand  endpoint. 

25 

20  -  ^^^^^^^^ 

\        \        \  \ 

5       10      15      20  25 

Figure  2.1.1     Chords  approximating  the  tangent  Hne.  (AP) 

So  far  we  have  found  the  slopes  of  two  chords  that  should  be  close  to  the  slope  of 
the  tangent  line,  but  what  is  the  slope  of  the  tangent  line  exactly?  Since  the  tangent  line 
touches  the  circle  at  just  one  point,  we  will  never  be  able  to  calculate  its  slope  directly, 
using  two  "known"  points  on  the  line.  What  we  need  is  a  way  to  capture  what  happens 
to  the  slopes  of  the  chords  as  they  get  "closer  and  closer"  to  the  tangent  line. 
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Instead  of  looking  at  more  particular  values  of  Ax,  let's  see  what  happens  if  we  do 
some  algebra  with  the  difference  quotient  using  just  Ax.  The  slope  of  a  chord  from  (7, 24) 
to  a  nearby  point  is  given  by 

^625  -  (7  +  Aa;)2  -  24  _  ^625  -  (7  +  Ax)^  -  24  v/625^^C^TA^  +  24 
Ax  ~  Ax  ^625  -  (7  +  Ax)2  +  24 

625  -  (7  +  Ax)2  -  242 


Ax(V625  - 

(7  +  Ax)2  +  24) 

49  -  49  - 

■  14Ax  -  Ax^ 

Ax(v/625  - 

(7  + Ax)2  +  24) 

Ax(- 

-14  -  Ax) 

Ax(V625  -  (7  + Ax)2  +  24) 

-11  -  A.r 


^625  -  (7  +  Ax)2  +  24 

Now,  can  we  tell  by  looking  at  this  last  formula  what  happens  when  Ax  gets  very  close  to 
zero?  The  numerator  clearly  gets  very  close  to  —14  while  the  denominator  gets  very  close  to 
\/625  -  72  +  24  =  48.  Is  the  fraction  therefore  very  close  to  -14/48  =  -7/24  ^  -0.29167? 
It  certainly  seems  reasonable,  and  in  fact  it  is  true:  as  Ax  gets  closer  and  closer  to  zero, 
the  difference  quotient  does  in  fact  get  closer  and  closer  to  —7/24,  and  so  the  slope  of  the 
tangent  line  is  exactly  —7/24. 

What  about  the  slope  of  the  tangent  line  at  x  =  12?  Well,  12  can't  be  all  that  different 
from  7;  we  just  have  to  redo  the  calculation  with  12  instead  of  7.  This  won't  be  hard,  but 
it  will  be  a  bit  tedious.  What  if  we  try  to  do  all  the  algebra  without  using  a  specific  value 
for  X?  Let's  copy  from  above,  replacing  7  by  x.  We'll  have  to  do  a  bit  more  than  that — for 
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example,  the  "24"  in  the  calculation  came  from  a/625  -  72,  so  we'll  need  to  fix  that  too. 

^625  -{x  +  Aa;)2  -  V625  -  _ 
Ax  ~ 
_  v/625  -(x  +  Aa;)2  -  V625  -  x'^  ^/625  -  (x  +  Ax)'^  +  V625  -  x^ 
Ax  ^625  -{x  +  Aa;)2  +  V625  -  a;^ 

625  -  (x  +  Ax)2  -  625  +  x^ 

{x  +  Ax)'^  +  \/625  —  x^) 

_  625  -x^  -  2xAx  -  Ax"^  -  625  + 

Ax  (^625  -  —  x^j 

_  Ax{-2x  -  Ax) 

~  Aa;(y625^^^xTA^+  \/625^^) 

—2x  —  Ax 
~  V625  -{x  +  Ax)^  +  \/625  -  x^ 

Now  what  happens  when  Ax  is  very  close  to  zero?  Again  it  seems  apparent  that  the 
quotient  will  be  very  close  to 

—2x  —2x  —X 

\/625  -  a;2  +  ^625  -  x^  ~  2^625  -  x^  ~  ^625  -  x^  ' 
Replacing  x  by  7  gives  —7/24,  as  before,  and  now  we  can  easily  do  the  computation  for  12 
or  any  other  value  of  x  between  —25  and  25. 

So  now  we  have  a  single,  simple  formula,  —x/ a/ 625  —  x'^,  that  tells  us  the  slope  of  the 
tangent  line  for  any  value  of  x.  This  slope,  in  turn,  tells  us  how  sensitive  the  value  of  y  is 
to  changes  in  the  value  of  x. 

What  do  we  call  such  a  formula?  That  is,  a  formula  with  one  variable,  so  that  substi- 
tuting an  "input"  value  for  the  variable  produces  a  new  "output"  value?  This  is  a  function. 
Starting  with  one  function,  ^/625~^—^ ,  we  have  derived,  by  means  of  some  slightly  nasty 
algebra,  a  new  function,  —x/ \/ 625  —  x^,  that  gives  us  important  information  about  the 
original  function.  This  new  function  in  fact  is  called  the  derivative  of  the  original  func- 
tion. If  the  original  is  referred  to  as  /  or  y  then  the  derivative  is  often  written  /'  or  y'  and 
pronounced  "f  prime"  or  "y  prime",  so  in  this  case  we  might  write  f'{x)  —  — a;/\/625^-^. 
At  a  particular  point,  say  x  =  7,  we  say  that  /'(7)  =  —7/24  or  "/  prime  of  7  is  —7/24"  or 
"the  derivative  of  /  at  7  is  -7/24." 

To  summarize,  we  compute  the  derivative  of  f{x)  by  forming  the  difference  quotient 

fix  +  Ax) -fix) 
Ax 

which  is  the  slope  of  a  line,  then  we  figure  out  what  happens  when  Ax  gets  very  close  to 
0. 
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We  should  note  that  in  the  particular  case  of  a  circle,  there's  a  simple  way  to  find  the 
derivative.  Since  the  tangent  to  a  circle  at  a  point  is  perpendicular  to  the  radius  drawn 
to  the  point  of  contact,  its  slope  is  the  negative  reciprocal  of  the  slope  of  the  radius.  The 
radius  joining  (0,  0)  to  (7,  24)  has  slope  24/7.  Hence,  the  tangent  line  has  slope  —7/24.  In 
general,  a  radius  to  the  point  (x,  \/ 625  —  x"^)  has  slope  \/ 625  —  x-^/x,  so  the  slope  of  the 

tangent  line  is  — x/ a/ 625  —  x^,  as  before.  It  is  NOT  always  true  that  a  tangent  line  is 
perpendicular  to  a  line  from  the  origin — don't  use  this  shortcut  in  any  other  circumstance. 

As  above,  and  as  you  might  expect,  for  different  values  of  x  we  generally  get  different 
values  of  the  derivative  /'(x).  Could  it  be  that  the  derivative  always  has  the  same  value? 
This  would  mean  that  the  slope  of  /,  or  the  slope  of  its  tangent  line,  is  the  same  everywhere. 
One  curve  that  always  has  the  same  slope  is  a  line;  it  seems  odd  to  talk  about  the  tangent 
line  to  a  line,  but  if  it  makes  sense  at  all  the  tangent  line  must  be  the  line  itself.  It  is  not 
hard  to  see  that  the  derivative  of  /(x)  =  mx  +  6  is  /'(x)  =  m;  see  exercise  6. 

Exercises  2.1. 

1.  Draw  the  graph  of  the  function  y  =  f{x)  =  \/ 169  —  between  a;  =  0  and  x  =  13.  Find  the 
slope  Ay/Ax  of  the  chord  between  the  points  of  the  circle  lying  over  (a)  x  =  12  and  x  =  13, 
(b)  X  =  12  and  a;  =  12.1,  (c)  x  =  12  and  x  =  12.01,  (d)  x  =  12  and  x  =  12.001.  Now  use 
the  geometry  of  tangent  lines  on  a  circle  to  find  (e)  the  exact  value  of  the  derivative  /'(12). 
Your  answers  to  (a)-(d)  should  be  getting  closer  and  closer  to  your  answer  to  (e). 

2.  Use  geometry  to  find  the  derivative  /'(x)  of  the  function  /(x)  =  \/ 625  —  x"^  in  the  text  for 
each  of  the  following  x:  (a)  20,  (b)  24,  (c)  —7,  (d)  —15.  Draw  a  graph  of  the  upper  semicircle, 
and  draw  the  tangent  line  at  each  of  these  four  points.  =^> 

3.  Draw  the  graph  of  the  function  y  =  /(x)  =  1/x  between  x  =  1/2  and  x  =  4.  Find  the  slope 
of  the  chord  between  (a)  x  =  3  and  x  =  3.1,  (b)  x  =  3  and  x  =  3.01,  (c)  x  =  3  and  x  =  3.001. 
Now  use  algebra  to  find  a  simple  formula  for  the  slope  of  the  chord  between  (3,  /(3))  and 
(3  +  Ax,/(3  +  Ax)).  Determine  what  happens  when  Ax  approaches  0.  In  your  graph  of 
y  =  1/x,  draw  the  straight  line  through  the  point  (3, 1/3)  whose  slope  is  this  limiting  value 
of  the  difi^erence  quotient  as  Ax  approaches  0.  =^ 

4.  Find  an  algebraic  expression  for  the  difi^erence  quotient  ^/(l  +  Ax)  —  /(l)^ /Ax  when  /(x)  = 

x^  —  (1/x).  Simplify  the  expression  as  much  as  possible.  Then  determine  what  happens  as 
Ax  approaches  0.  That  value  is  /'(I).  =^> 

5.  Draw  the  graph  of  y  =  /(x)  =  x^  between  x  =  0  and  x  =  1.5.  Find  the  slope  of  the  chord 
between  (a)  x  =  1  and  x  =  1.1,  (b)  x  =  1  and  x  =  1.001,  (c)  x  =  1  and  x  =  1.00001. 
Then  use  algebra  to  find  a  simple  formula  for  the  slope  of  the  chord  between  1  and  1  +  Ax. 
(Use  the  expansion  {A  +  By^  =  +  3A  B  +  3AB  +  B-\)  Determine  what  happens  as  Ax 
approaches  0,  and  in  your  graph  of  y  =  x^  draw  the  straight  line  through  the  point  (1,1) 
whose  slope  is  equal  to  the  value  you  just  found.  =^> 

6.  Find  an  algebraic  expression  for  the  difi^erence  quotient  (/(x  +  Ax)  —  /(x))/Ax  when  /(x)  = 
mx  +  h.  Simplify  the  expression  as  much  as  possible.  Then  determine  what  happens  as  Ax 
approaches  0.  That  value  is  f'{x).  =^ 
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7.  Sketch  the  unit  circle.  Discuss  the  behavior  of  the  slope  of  the  tangent  line  at  various  angles 
around  the  circle.  Which  trigonometric  function  gives  the  slope  of  the  tangent  line  at  an 
angle  9?  Why?  Hint:  think  in  terms  of  ratios  of  sides  of  triangles. 

8.  Sketch  the  parabola  y  —  .  For  what  values  of  x  on  the  parabola  is  the  slope  of  the  tangent 
line  positive?  Negative?  What  do  you  notice  about  the  graph  at  the  point  (s)  where  the  sign 
of  the  slope  changes  from  positive  to  negative  and  vice  versa? 

2.2    An  example 

We  started  the  last  section  by  saying,  "It  is  often  necessary  to  know  how  sensitive  the 
value  of  y  is  to  small  changes  in  x."  We  have  seen  one  purely  mathematical  example  of 
this:  finding  the  "steepness"  of  a  curve  at  a  point  is  precisely  this  problem.  Here  is  a  more 
applied  example. 

With  careful  measurement  it  might  be  possible  to  discover  that  a  dropped  ball  has 
height  h(t)  =  ho  —  kt^,  t  seconds  after  it  is  released.  (Here  ho  is  the  initial  height  of  the  ball, 
when  t  =  0,  and  k  is  some  number  determined  by  the  experiment.)  A  natural  question  is 
then,  "How  fast  is  the  ball  going  at  time  t?"  We  can  certainly  get  a  pretty  good  idea  with  a 
little  simple  arithmetic.  To  make  the  calculation  more  concrete,  let's  say  ho  =  100  meters 
and  k  —  4.9  and  suppose  we're  interested  in  the  speed  at  t  —  2.  We  know  that  when  t  —  2 
the  height  is  100  -  4  ■  4.9  =  80.4.  A  second  later,  at  t  =  3,  the  height  is  100  -  9  ■  4.9  =  55.9, 
so  in  that  second  the  ball  has  traveled  80.4  —  55.9  =  24.5  meters.  This  means  that  the 
average  speed  during  that  time  was  24.5  meters  per  second.  So  we  might  guess  that  24.5 
meters  per  second  is  not  a  terrible  estimate  of  the  speed  at  t  =  2.  But  certainly  we  can 
do  better.  At  t  —  2.5  the  height  is  100  —  4.9(2.5)^  =  69.375.  During  the  half  second  from 
t  =  2  to  t  =  2.5  the  ball  dropped  80.4  —  69.375  =  11.025  meters,  at  an  average  speed  of 
11.025/(1/2)  =  22.05  meters  per  second;  this  should  be  a  better  estimate  of  the  speed  at 
t  =  2.  So  it's  clear  now  how  to  get  better  and  better  approximations:  compute  average 
speeds  over  shorter  and  shorter  time  intervals.  Between  t  =  2  and  t  =  2.01,  for  example, 
the  ball  drops  0.19649  meters  in  one  hundredth  of  a  second,  at  an  average  speed  of  19.649 
meters  per  second. 

We  can't  do  this  forever,  and  we  still  might  reasonably  ask  what  the  actual  speed 
precisely  at  t  =  2  is.  If  At  is  some  tiny  amount  of  time,  what  we  want  to  know  is  what 
happens  to  the  average  speed  {h{2)  —  h{2  +  At))/At  as  At  gets  smaller  and  smaller.  Doing 
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a  bit  of  algebra: 

h{2)  -  h{2  +  At)  _  80.4  -  (100  -  4.9(2  +  At)^) 
At  ~  At 

_  80.4  -  100  +  19.6  +  19.6At  +  4.9 At'^ 
~  At 
_  19.6At  +  4.9At2 
~  At 
=  19.6  +  4.9At 

When  At  is  very  small,  this  is  very  close  to  19.6,  and  indeed  it  seems  clear  that  as  At 
goes  to  zero,  the  average  speed  goes  to  19.6,  so  the  exact  speed  at  t  =  2  is  19.6  meters  per 
second.  This  calculation  should  look  very  familiar.  In  the  language  of  the  previous  section, 
we  might  have  started  with  f{x)  =  100  —  4.9x^  and  asked  for  the  slope  of  the  tangent  line 
at  a;  =  2.  We  would  have  answered  that  question  by  computing 

/(2  +  A.)-/(2)  ^  -19.6A.-4.9A.^  ^        ^  _  ^^^^ 
Ax  Ax 

The  algebra  is  the  same,  except  that  following  the  pattern  of  the  previous  section  the 
subtraction  would  be  reversed,  and  we  would  say  that  the  slope  of  the  tangent  line  is 
—19.6.  Indeed,  in  hindsight,  perhaps  we  should  have  subtracted  the  other  way  even  for 
the  dropping  ball.  At  t  =  2  the  height  is  80.4;  one  second  later  the  height  is  55.9.  The 
usual  way  to  compute  a  "distance  traveled"  is  to  subtract  the  earlier  position  from  the 
later  one,  or  55.9  —  80.4  =  —24.5.  This  tells  us  that  the  distance  traveled  is  24.5  meters, 
and  the  negative  sign  tells  us  that  the  height  went  down  during  the  second.  If  we  continue 
the  original  calculation  we  then  get  —19.6  meters  per  second  as  the  exact  speed  at  t  =  2. 
If  we  interpret  the  negative  sign  as  meaning  that  the  motion  is  downward,  which  seems 
reasonable,  then  in  fact  this  is  the  same  answer  as  before,  but  with  even  more  information, 
since  the  numerical  answer  contains  the  direction  of  motion  as  well  as  the  speed.  Thus, 
the  speed  of  the  ball  is  the  value  of  the  derivative  of  a  certain  function,  namely,  of  the 
function  that  gives  the  position  of  the  ball.  (More  properly,  this  is  the  velocity  of  the  ball; 
velocity  is  signed  speed,  that  is,  speed  with  a  direction  indicated  by  the  sign.) 

The  upshot  is  that  this  problem,  finding  the  speed  of  the  ball,  is  exactly  the  same 
problem  mathematically  as  finding  the  slope  of  a  curve.  This  may  already  be  enough 
evidence  to  convince  you  that  whenever  some  quantity  is  changing  (the  height  of  a  curve 
or  the  height  of  a  ball  or  the  size  of  the  economy  or  the  distance  of  a  space  probe  from 
earth  or  the  population  of  the  world)  the  rate  at  which  the  quantity  is  changing  can,  in 
principle,  be  computed  in  exactly  the  same  way,  by  finding  a  derivative. 
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Exercises  2.2. 

1.  An  object  is  traveling  in  a  straight  line  so  that  its  position  (that  is,  distance  from  some  fixed 
point)  is  given  by  this  table: 


time  (seconds) 

0 

1 

2 

3 

distance  (meters) 

0 

10 

25 

60 

Find  the  average  speed  of  the  object  during  the  following  time  intervals:  [0, 1],  [0,2],  [0,3], 
[1,2],  [1,3],  [2,3].  If  you  had  to  guess  the  speed  at  t  =  2  just  on  the  basis  of  these,  what 
would  you  guess?  =^> 

2.  Let  y  =  f{t)  =  t'^ ,  where  t  is  the  time  in  seconds  and  y  is  the  distance  in  meters  that  an 
object  falls  on  a  certain  airless  planet.  Draw  a  graph  of  this  function  between  t  =  0  and 
t  =  3.  Make  a  table  of  the  average  speed  of  the  falling  object  between  (a)  2  sec  and  3  sec, 
(b)  2  sec  and  2.1  sec,  (c)  2  sec  and  2.01  sec,  (d)  2  sec  and  2.001  sec.  Then  use  algebra  to  find 
a  simple  formula  for  the  average  speed  between  time  2  and  time  2  +  At.  (If  you  substitute 
At  =  1,  0.1,  0.01,  0.001  in  this  formula  you  should  again  get  the  answers  to  parts  (a)-(d).) 
Next,  in  your  formula  for  average  speed  (which  should  be  in  simplified  form)  determine  what 
happens  as  At  approaches  zero.  This  is  the  instantaneous  speed.  Finally,  in  your  graph 
oi  y  =  draw  the  straight  line  through  the  point  (2,4)  whose  slope  is  the  instantaneous 
velocity  you  just  computed;  it  should  of  course  be  the  tangent  line.  =^ 

3.  If  an  object  is  dropped  from  an  80-meter  high  window,  its  height  y  above  the  ground  at  time 
t  seconds  is  given  by  the  formula  y  =  f{t)  =  80-4.9t  .  (Here  we  are  neglecting  air  resistance; 
the  graph  of  this  function  was  shown  in  figure  1.0.1.)  Find  the  average  velocity  of  the  falling 
object  between  (a)  1  sec  and  1.1  sec,  (b)  1  sec  and  1.01  sec,  (c)  1  sec  and  1.001  sec.  Now  use 
algebra  to  find  a  simple  formula  for  the  average  velocity  of  the  falling  object  between  1  sec 
and  1  +  At  sec.  Determine  what  happens  to  this  average  velocity  as  At  approaches  0.  That 
is  the  instantaneous  velocity  at  time  t  =  1  second  (it  will  be  negative,  because  the  object  is 
falling). 

2.3  Limits 

In  the  previous  two  sections  we  computed  some  quantities  of  interest  (slope,  velocity)  by 
seeing  that  some  expression  "goes  to"  or  "approaches"  or  "gets  really  close  to"  a  particular 
value.  In  the  examples  we  saw,  this  idea  may  have  been  clear  enough,  but  it  is  too  fuzzy 
to  rely  on  in  more  difficult  circumstances.  In  this  section  we  will  see  how  to  make  the  idea 
more  precise. 

There  is  an  important  feature  of  the  examples  we  have  seen.  Consider  again  the 
formula 

-19.6Ax  -  4.9Ax2 
Ax 

We  wanted  to  know  what  happens  to  this  fraction  as  "Ax  goes  to  zero."  Because  we  were 
able  to  simplify  the  fraction,  it  was  easy  to  see  the  answer,  but  it  was  not  quite  as  simple 
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as  "substituting  zero  for  A  X  ^    ctS  that  would  give 

-19.6- 0-4.9-0 


0 

which  is  meaningless.  The  quantity  we  are  really  interested  in  does  not  make  sense  "at 
zero,"  and  this  is  why  the  answer  to  the  original  problem  (finding  a  velocity  or  a  slope) 
was  not  immediately  obvious.  In  other  words,  we  are  generally  going  to  want  to  figure 
out  what  a  quantity  "approaches"  in  situations  where  we  can't  merely  plug  in  a  value.  If 
you  would  like  to  think  about  a  hard  example  (which  we  will  analyze  later)  consider  what 
happens  to  (sina;)/a;        approaches  zero. 

EXAMPLE  2.3.1  Does  s/x  approach  1.41  as  x  approaches  2?  In  this  case  it  is  possible 
to  compute  the  actual  value  -\/2  to  a  high  precision  to  answer  the  question.  But  since 
in  general  we  won't  be  able  to  do  that,  let's  not.  We  might  start  by  computing  y/x  for 
values  of  x  close  to  2,  as  we  did  in  the  previous  sections.  Here  are  some  values:  V2.05  = 
1.431782106,  V2M  =  1.428285686,  V2m  =  1.424780685,  V2m  =  1.421267040,  V2M  = 
1.417744688,  V2.005  =  1.415980226,  V2.004  =  1.415627070,  V2.003  =  1.415273825, 
V2.002  =  1.414920492,  V2.001  =  1.414567072.  So  it  looks  at  least  possible  that  indeed 
these  values  "approach"  1.41 — already  -\/2.001  is  quite  close.  If  we  continue  this  process, 
however,  at  some  point  we  will  appear  to  "stall."  In  fact,  V2  =  1.414213562 . . .,  so  we  will 
never  even  get  as  far  as  1.4142,  no  matter  how  long  we  continue  the  sequence.  □ 

So  in  a  fuzzy,  everyday  sort  of  sense,  it  is  true  that  y/x  "gets  close  to"  1.41,  but  it 
does  not  "approach"  1.41  in  the  sense  we  want.  To  compute  an  exact  slope  or  an  exact 
velocity,  what  we  want  to  know  is  that  a  given  quantity  becomes  "arbitrarily  close"  to  a 
fixed  value,  meaning  that  the  first  quantity  can  be  made  "as  close  as  we  like"  to  the  fixed 
value.  Consider  again  the  quantities 

-19.6Aa;  -  4.9Ax^         ^  ^     ,  ^  , 

 =  -19.6  -  4.9Aa;. 

Ax 

These  two  quantities  are  equal  as  long  as  Ax  is  not  zero;  if  Ax  is  zero,  the  left  hand 
quantity  is  meaningless,  while  the  right  hand  one  is  —19.6.  Can  we  say  more  than  we 
did  before  about  why  the  right  hand  side  "approaches"  —19.6,  in  the  desired  sense?  Can 
we  really  make  it  "as  close  as  we  want"  to  —19.6?  Let's  try  a  test  case.  Can  we  make 

—  19.6  —  4.9Aa;  within  one  millionth  (0.000001)  of  —19.6?  The  values  within  a  millionth  of 

—  19.6  are  those  in  the  interval  (-19.600001,-19.599999).  As  Ax  approaches  zero,  does 

—  19.6  —  4.9 Ax  eventually  reside  inside  this  interval?  If  Aa;  is  positive,  this  would  require 
that  —19.6  —  4.9 Ax  >  —19.600001.  This  is  something  we  can  manipulate  with  a  little 
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algebra: 

-19.6  -  A.9Ax  >  -19.600001 
-4:.9Ax  >  -0.000001 

Ax  <  -0.000001/  -4.9 

Ax  <  0.0000002040816327. . . 

Thus,  we  can  say  with  certainty  that  if  Ax  is  positive  and  less  than  0.0000002,  then 
Ax  <  0.0000002040816327 ...  and  so  -19.6  -  4.9Aa;  >  -19.600001.  We  could  do  a  similar 
calculation  if  Ax  is  negative. 

So  now  we  know  that  we  can  make  —19.6  —  4. 9 Ax  within  one  millionth  of  —19.6.  But 
can  we  make  it  "as  close  as  we  want"  ?  In  this  case,  it  is  quite  simple  to  see  that  the  answer 
is  yes,  by  modifying  the  calculation  we've  just  done.  It  may  be  helpful  to  think  of  this  as 
a  game.  I  claim  that  I  can  make  —19.6  —  4.9 Ax  as  close  as  you  desire  to  —19.6  by  making 
Ax  "close  enough"  to  zero.  So  the  game  is:  you  give  me  a  number,  like  10~^,  and  I  have 
to  come  up  with  a  number  representing  how  close  Ax  must  be  to  zero  to  guarantee  that 
—19.6  —  4.9Aa;  is  at  least  as  close  to  —19.6  as  you  have  requested. 

Now  if  we  actually  play  this  game,  I  could  redo  the  calculation  above  for  each  new 
number  you  provide.  What  I'd  like  to  do  is  somehow  see  that  I  will  always  succeed,  and 
even  more,  I'd  like  to  have  a  simple  strategy  so  that  I  don't  have  to  do  all  that  algebra 
every  time.  A  strategy  in  this  case  would  be  a  formula  that  gives  me  a  correct  answer  no 
matter  what  you  specify.  So  suppose  the  number  you  give  me  is  e.  How  close  does  Ax 
have  to  be  to  zero  to  guarantee  that  —19.6  —  4.9Aa;  is  in  (—19.6  —  e,  —19.6  +  e)?  If  Ax  is 
positive,  we  need: 

-19.6-4.9Aa;  >  -19.6  -  e 
-4.9Aa;  >  -e 

Aa;  <  -e/  -4.9 
Aa;  <  e/4.9 

So  if  I  pick  any  number  S  that  is  less  than  e/4.9,  the  algebra  tells  me  that  whenever  Ax  <  S 
then  Ax  <  e/4.9  and  so  —19.6  —  4.9Aa;  is  within  e  of  —19.6.  (This  is  exactly  what  I  did 
in  the  example:  I  picked  6  =  0.0000002  <  0.0000002040816327 . . ..)  A  similar  calculation 
again  works  for  negative  Ax.  The  important  fact  is  that  this  is  now  a  completely  general 
result — it  shows  that  I  can  always  win,  no  matter  what  "move"  you  make. 

Now  we  can  codify  this  by  giving  a  precise  definition  to  replace  the  fuzzy,  "gets  closer 
and  closer"  language  we  have  used  so  far.  Henceforward,  we  will  say  something  like  "the 
limit  of  (— 19.6Aa;— 4.9Aa;^)/Aa;  as  Aa;  goes  to  zero  is  —19.6,"  and  abbreviate  this  mouthful 
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as 

-19.6Aa; -4.9Aa;2 
hm  =  -19.6. 

Ax^O  Ax 

Here  is  the  actual,  official  definition  of  "limit" . 

DEFINITION  2.3.2  Limit  Suppose  /  is  a  function.  We  say  that  Hm  f{x)  =  L  if 
for  every  e  >  0  there  is  a  5  >  0  so  that  whenever  0  <  \x  —  a\  <  5,  \f{x)  —  L\  <  e.  □ 

The  e  and  5  here  play  exactly  the  role  they  did  in  the  preceding  discussion.  The 
definition  says,  in  a  very  precise  way,  that  f{x)  can  be  made  as  close  as  desired  to  L 
(that's  the  \f{x)  —  L\  <  e  part)  by  making  x  close  enough  to  a  (the  0  <  |a;  —  a|  <  5 
part).  Note  that  we  specifically  make  no  mention  of  what  must  happen  if  x  =  a,  that  is, 
if  \x  —  a\  =  0.  This  is  because  in  the  cases  we  are  most  interested  in,  substituting  a  for  x 
doesn't  even  make  sense. 

Make  sure  you  are  not  confused  by  the  names  of  important  quantities.  The  generic 
definition  talks  about  /(x),  but  the  function  and  the  variable  might  have  other  names.  In 
the  discussion  above,  the  function  we  analyzed  was 

-19.6Aa;  -  4:.9Ax^ 


Ax 

and  the  variable  of  the  limit  was  not  x  but  Ax.  The  x  was  the  variable  of  the  original 
function;  when  we  were  trying  to  compute  a  slope  or  a  velocity,  x  was  essentially  a  fixed 
quantity,  telling  us  at  what  point  we  wanted  the  slope.  (In  the  velocity  problem,  it  was 
literally  a  fixed  quantity,  as  we  focused  on  the  time  2.)  The  quantity  a  of  the  definition 
in  all  the  examples  was  zero:  we  were  always  interested  in  what  happened  as  Ax  became 
very  close  to  zero. 

Armed  with  a  precise  definition,  we  can  now  prove  that  certain  quantities  behave  in  a 
particular  way.  The  bad  news  is  that  even  proofs  for  simple  quantities  can  be  quite  tedious 
and  complicated;  the  good  news  is  that  we  rarely  need  to  do  such  proofs,  because  most 
expressions  act  the  way  you  would  expect,  and  this  can  be  proved  once  and  for  all. 


EXAMPLE  2.3.3     Let's  show  carefully  that  lim  x  +  4  —  6.  This  is  not  something  we 

"need"  to  prove,  since  it  is  "obviously"  true.  But  if  we  couldn't  prove  it  using  our  official 
definition  there  would  be  something  very  wrong  with  the  definition. 

As  is  often  the  case  in  mathematical  proofs,  it  helps  to  work  backwards.  We  want  to 
end  up  showing  that  under  certain  circumstances  a;  +  4  is  close  to  6;  precisely,  we  want  to 
show  that  |x  +  4  —  6|  <  e,  or  |x  —  2|  <  e.  Under  what  circumstances?  We  want  this  to  be 
true  whenever  0  <  |a;  —  2|  <  5.  So  the  question  becomes:  can  we  choose  a  value  for  6  that 
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guarantees  that  0  <  |a;  —  2|  <  5  implies  |a;  —  2|  <  e?  Of  course:  no  matter  what  e  is,  5  =  e 
works.  □ 

So  it  turns  out  to  be  very  easy  to  prove  something  "obvious,"  which  is  nice.  It  doesn't 
take  long  before  things  get  trickier,  however. 

EXAMPLE  2.3.4    It  seems  clear  that  lim      =  4.  Let's  try  to  prove  it.  We  will  want 

a;->2 

to  be  able  to  show  that  \x'^  —  4|  <  e  whenever  0  <  |a;  —  2|  <  5,  by  choosing  5  carefully.  Is 
there  any  connection  between  \x  —  2|  and  \x^  —  4|?  Yes,  and  it's  not  hard  to  spot,  but  it  is 
not  so  simple  as  the  previous  example.  We  can  write  |a;^  —  4|  =  |  (a;  +  2) (a;  —  2) | .  Now  when 
|a;  — 2|  is  small,  part  of  |(a;  +  2)(a;  — 2)|  is  small,  namely  {x  —  2).  What  about  {x  +  2)?  If  x  is 
close  to  2,  (a;  +  2)  certainly  can't  be  too  big,  but  we  need  to  somehow  be  precise  about  it. 
Let's  recall  the  "game"  version  of  what  is  going  on  here.  You  get  to  pick  an  e  and  I  have  to 
pick  a  5  that  makes  things  work  out.  Presumably  it  is  the  really  tiny  values  of  e  I  need  to 
worry  about,  but  I  have  to  be  prepared  for  anything,  even  an  apparently  "bad"  move  like 
e  =  1000.  I  expect  that  e  is  going  to  be  small,  and  that  the  corresponding  6  will  be  small, 
certainly  less  than  1.  If  5  <  1  then  |x  +  2|  <  5  when  —  2|  <  6  (because  if  x  is  within  1  of 
2,  then  x  is  between  1  and  3  and  x  +  2  is  between  3  and  5).  So  then  I'd  be  trying  to  show 
that  \{x  +  2){x  —  2)1  <  5\x  —  2|  <  e.  So  now  how  can  I  pick  6  so  that  |x  —  2|  <  5  implies 
5\x  —  2|  <  e?  This  is  easy:  use  d  —  e/5,  so  5\x  —  2\  <  5(e/5)  =  e.  But  what  if  the  e  you 
choose  is  not  small?  If  you  choose  e  =  1000,  should  I  pick  5  =  200?  No,  to  keep  things 
"sane"  I  will  never  pick  a  5  bigger  than  1.  Here's  the  final  "game  strategy:"  When  you 
pick  a  value  for  e  I  will  pick  S  =  e/5  or  S  =  1,  whichever  is  smaller.  Now  when  —  2|  <  S, 
I  know  both  that  |a;  +  2|  <  5  and  that  |a;  -  2|  <  e/5.  Thus  |(a;  +  2)(a;  -  2)|  <  5(e/5)  =  e. 

This  has  been  a  long  discussion,  but  most  of  it  was  explanation  and  scratch  work.  If 
this  were  written  down  as  a  proof,  it  would  be  quite  short,  like  this: 

Proof  that  lim  x"^  =  4.  Given  any  e,  pick  5  =  e/5  or  5  =  1,  whichever  is  smaller.  Then 

when  |a;-2|  <  6,  \x  +  2\  <  5  and  |a;-2|  <  e/5.  Hence  -4|  =  |(a;  +  2)(a;-2)|  <  5(e/5)  = 
e.  □ 


It  probably  seems  obvious  that  lim  x    =4,  and  it  is  worth  examining  more  closely 

why  it  seems  obvious.  If  we  write  x'^  =  x  ■  x,  and  ask  what  happens  when  x  approaches  2, 
we  might  say  something  like,  "Well,  the  first  x  approaches  2,  and  the  second  x  approaches 
2,  so  the  product  must  approach  2  ■  2."  In  fact  this  is  pretty  much  right  on  the  money, 
except  for  that  word  "must."  Is  it  really  true  that  if  x  approaches  a  and  y  approaches  b 
then  xy  approaches  abl  It  is,  but  it  is  not  really  obvious,  since  x  and  y  might  be  quite 
complicated.  The  good  news  is  that  we  can  see  that  this  is  true  once  and  for  all,  and  then 


2.3    Limits  43 


we  don't  have  to  worry  about  it  ever  again.  When  we  say  that  x  might  be  "comphcated" 
we  really  mean  that  in  practice  it  might  be  a  function.  Here  is  then  what  we  want  to  know: 

THEOREM  2.3.5    Suppose  lim  f(x)  =  L  and  lim  g(x)  =  M.  Then 

x—)-a  x—^a 

lima;^a  f{x)g{x)  =  LM. 

Proof.    We  have  to  use  the  official  definition  of  limit  to  make  sense  of  this.  So  given  any 
e  we  need  to  find  a  5  so  that  0  <  \x  —  a\  <  5  implies  \  f{x)g{x)  —  LM\  <  e.  What  do  we 
have  to  work  with?  We  know  that  we  can  make  f{x)  close  to  L  and  g{x)  close  to  M,  and 
we  have  to  somehow  connect  these  facts  to  make  f{x)g{x)  close  to  LM. 
We  use,  as  is  so  often  the  case,  a  little  algebraic  trick: 

\f{x)g{x)  -  LM\  =  \fix)gix)  -  f{x)M  +  f{x)M  -  LM\ 

=  \f{x){g{x)-M)  +  {f{x)-L)M\ 

<  \f{x){g{x)-M)\  +  \{f{x)-L)M\ 

=  \f{x)\\g{x)-M\  +  \f{x)-L\\M\. 

This  is  all  straightforward  except  perhaps  for  the  "<".  That  is  an  example  of  the  triangle 
inequality,  which  says  that  if  a  and  h  are  any  real  numbers  then  |o  +  6|  <  |a|  +       If  you 
look  at  a  few  examples,  using  positive  and  negative  numbers  in  various  combinations  for 
a  and  &,  you  should  quickly  understand  why  this  is  true;  we  will  not  prove  it  formally. 
Since  lim  f{x)  —  L,  there  is  a  value  di  so  that  0  <  |x  —  a|  <  5i  implies  \  f{x)  —  L\  < 

|e/(2M)|,  This  means  that  0  <  |a;  —  a|  <  5i  implies  \f{x)  —  L\\M\  <  e/2.  You  can  see 
where  this  is  going:  if  we  can  make  \f{x)\\g{x)  —  M\  <  e/2  also,  then  we'll  be  done. 

We  can  make  \g{x)  —  M\  smaller  than  any  fixed  number  by  making  x  close  enough 
to  a;  unfortunately,  e/(2/(x))  is  not  a  fixed  number,  since  x  is  a  variable.  Here  we  need 
another  little  trick,  just  like  the  one  we  used  in  analyzing  x'^.  We  can  find  a  62  so  that 
|a;  —  a|  <  62  implies  that  \  f{x)  —  L|  <  1,  meaning  that  L  —  1  <  f{x)  <  L  +  1.  This  means 
that  <  N,  where  N  is  either     —  1|  or  |L  +  1|,  depending  on  whether  L  is  negative 

or  positive.  The  important  point  is  that  N  doesn't  depend  on  x.  Finally,  we  know  that 
there  is  a  ^3  so  that  0  <  |a;  —  a|  <  ^3  implies  \g{x)  —  M\  <  e/{2N).  Now  we're  ready  to 
put  everything  together.  Let  5  be  the  smallest  of  5i,  52,  and  ^3.  Then  \x  —  a\  <  5  implies 
that  |/(a;)  -  L|  <  |e/(2M)|,  |/(a;)|  <  iV,  and  \g{x)  -  M\  <  e/{2N).  Then 

\f{x)g{x)  -  LM\  <  \f{x)\\g{x)     M\  +  \f{x)  -  L\\M\ 

<N—+  —  IMI 
2iV      2M  '  ' 

e  e 
=  2  +  2=^- 

This  is  just  what  we  needed,  so  by  the  official  definition,  lim  f{x)g{x)  =  LM.  m 
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A  handful  of  such  theorems  give  us  the  tools  to  compute  many  limits  without  explicitly- 
working  with  the  definition  of  limit. 

THEOREM  2.3.6  Suppose  that  lim  f(x)  =  L  and  lim  g(x)  =  M  and  k  is  some 
constant.  Then 

lim  kf{x)  =  k  lim  f{x)  =  kL 

lim  {f{x)  +  g{x))  =  lim  f{x)  +  lim  g{x)  =  L  +  M 

x—^a  x—^a  x—^a 

lim  {f{x)  —  g{x))  =  lim  f{x)  —  lim  g{x)  =  L  —  M 

X— >-a  X— >a  X— >a 

lim  (/(a;)5f(a;))  =  lim  f{x)  ■  lim  g{x)  —  LM 

x—^a  x—^a  x—^a 

f{x)      lim^^afix)  L 

lim  — —  =  =  TT)  if  -'W^  IS  not  0 

x^a  g{x)       hm^^ag{X)  M 


Roughly  speaking,  these  rules  say  that  to  compute  the  limit  of  an  algebraic  expression, 
it  is  enough  to  compute  the  limits  of  the  "innermost  bits"  and  then  combine  these  limits. 
This  often  means  that  it  is  possible  to  simply  plug  in  a  value  for  the  variable,  since 
lim  x  =  a. 

x—^a 

EXAMPLE  2.3.7    Compute  lim  .  If  we  apply  the  theorem  in  all  its  gory 

x^\      X  —  2 

detail,  we  get 

x^  —  2)X-\-b      limj;^.i(x^  —  3a;  +  5) 

lim  =  

x^i      x  —  2  lima;^i(x  — 2) 

_  (lim-g^i  x^)  -  (lim^^i  3a:)  +  (lima;^i  5) 
(lim^^i  a;)  -  (lim^^^i  2) 

_  (lim-e^i  xY  -  3(lim:r^i  x)-\-h 
(lim^^i  x)  -2 

12-3-1  +  5 


1-2 
1-3  +  5 


=  -3 


□ 


It  is  worth  commenting  on  the  trivial  limit  lim  5.  From  one  point  of  view  this  might 
seem  meaningless,  as  the  number  5  can't  "approach"  any  value,  since  it  is  simply  a  fixed 
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number.  But  5  can,  and  should,  be  interpreted  here  as  the  function  that  has  value  5 
everywhere,  f{x)  =  5,  with  graph  a  horizontal  line.  From  this  point  of  view  it  makes  sense 
to  ask  what  happens  to  the  height  of  the  function  as  x  approaches  1. 

Of  course,  as  we've  already  seen,  we're  primarily  interested  in  limits  that  aren't  so  easy, 
namely,  limits  in  which  a  denominator  approaches  zero.  There  are  a  handful  of  algebraic 
tricks  that  work  on  many  of  these  limits. 

+  2x  —  3 

EXAMPLE  2.3.8     Compute  lim  .  We  can't  simply  plug  in  x  =  1  because 

x^l       X  —  1 
that  makes  the  denominator  zero.  However: 

x2  +  2x-3      ,      (x-l)(x  +  3) 
lim    =  nm   

x^l        X  —  1  x^l  X  —  1 

=  lim  (x  +  3)  =  4 

□ 

While  theorem  2.3.6  is  very  helpful,  we  need  a  bit  more  to  work  easily  with  limits. 
Since  the  theorem  applies  when  some  limits  are  already  known,  we  need  to  know  the 
behavior  of  some  functions  that  cannot  themselves  be  constructed  from  the  simple  arith- 
metic operations  of  the  theorem,  such  as  ^/x.  Also,  there  is  one  other  extraordinarily 
useful  way  to  put  functions  together:  composition.  If  /(x)  and  g{x)  are  functions,  we  can 
form  two  functions  by  composition:  f{g{x))  and  g{f{x)).  For  example,  if  /(x)  =  a/x  and 

g{x)  =  x^  +  5,  then  f{g{x))  =  a/x^  +  5  and  g{f{x))  =  (y/x)'^  +  5  =  x  +  5.  Here  is  a 
companion  to  theorem  2.3.6  for  composition: 

THEOREM  2.3.9     Suppose  that  Um  g{x)  =  L  and  lim  /(x)  =  /(L).  Then 


L 


lim/(<7(x))  =  /(L). 


Note  the  special  form  of  the  condition  on  /:  it  is  not  enough  to  know  that  lim  /(x)  = 

x^L 

M,  though  it  is  a  bit  tricky  to  see  why.  Many  of  the  most  familiar  functions  do  have  this 
property,  and  this  theorem  can  therefore  be  applied.  For  example: 

THEOREM  2.3.10     Suppose  that  n  is  a  positive  integer.  Then 

lim  a/x  =  a/o. 


provided  that  a  is  positive  if  n  is  even. 
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This  theorem  is  not  too  difficult  to  prove  from  the  definition  of  limit. 
Another  of  the  most  common  algebraic  tricks  was  used  in  section  2.1.  Here's  another 
example: 

EXAMPLE  2.3.11     Compute  lim        ^  ^  ~  ^ 


-1      x  +  1 

Vx  +  5-2       ^.      ^/x+~5  -  2  ^/x+~5  +  2 
lim   =  lim  ^  

x^-l        X  +  1  X  +  1        ^/x  +  5  +  2 

X  +  5  -  4 

=  lim   ,   

^^-1  {x  +  l){y/x  +  5  +  2) 

x  +  1 

=  lim   ,   

^^-1  {x  +  l){^/x  +  5  +  2) 

=  lim     ,    =  \ 

^^-1  y/x  +  5  +  2  4 

At  the  very  last  step  we  have  used  theorems  2.3.9  and  2.3.10.  □ 

Occasionally  we  will  need  a  slightly  modified  version  of  the  limit  definition.  Consider 
the  function  /(x)  =  vT^-a?^,  the  upper  half  of  the  unit  circle.  What  can  we  say  about 
lim  f{x)7  It  is  apparent  from  the  graph  of  this  familiar  function  that  as  x  gets  close  to  1 

x^l 

from  the  left,  the  value  of  /(x)  gets  close  to  zero.  It  does  not  even  make  sense  to  ask  what 
happens  as  x  approaches  1  from  the  right,  since  /(x)  is  not  defined  there.  The  definition 
of  the  limit,  however,  demands  that  /(I  +  Ax)  be  close  to  /(I)  whether  Ax  is  positive  or 
negative.  Sometimes  the  limit  of  a  function  exists  from  one  side  or  the  other  (or  both)  even 
though  the  limit  does  not  exist.  Since  it  is  useful  to  be  able  to  talk  about  this  situation, 
we  introduce  the  concept  of  one  sided  limit: 

DEFINITION  2.3.12     One-sided  limit      Suppose  that  /(x)  is  a  function.  We  say 

that   lim  /(x)  =  L  if  for  every  e  >  0  there  is  a  5  >  0  so  that  whenever  0  <  a  —  x  <  5, 

|/(x)  —  L|  <  e.  We  say  that  lim^^a+  f{x)  =  L  if  for  every  e  >  0  there  is  a  5  >  0  so  that 
whenever  0  <  x  —  a  <  5,  |/(x)  —  L|  <  e.  □ 

Usually  lim  /(x)  is  read  "the  limit  of  /(x)  from  the  left"  and  lim  /(x)  is  read  "the 
limit  of  /(x)  from  the  right". 

EXAMPLE  2.3.13     Discuss  lim  — ,  lim  — ,  and  lim  — . 

x^-O  |x|    x^-O-  |x|  x^.0+  |x| 

The  function  /(x)  =  x/|x|  is  undefined  at  0;  when  x  >  0,  |x|  =  x  and  so  /(x)  =  1; 
when  X  <  0,  |x|  =  — x  and  /(x)  =  —1.  Thus  lim  - — -  =  lim  —1  =  —1  while  lim  - — -  = 

x^-O-    X         x^-O-  x^.0+  X 
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lim  1  =  1.  The  limit  of  /(x)  must  be  equal  to  both  the  left  and  right  limits;  since  they 

X 

are  different,  the  limit  lim  - — -  does  not  exist.  □ 

x^O  \x\ 

Exercises  2.3. 

Compute  the  limits.  If  a  limit  does  not  exist,  explain  why. 
1.    lim    =^  2.  lim   

x^Z         X  —  6  x^l         X  —  6 

„            x^  +  x-l2  ^  x^  +  x-l2 

3.     hm   =^  4.  hm 


x^-A      a;  —  3  x^2  X 


lim  ^""^  ~^  ^ — ^  6.     lim  ' 


x^l       X  —  1  x^0+  \  X  \  X 

7.    lim  3  8.    lim  3x^  —  5x 

X^2  33— >-4 

,.     4x  —  ^„     ,.         —  1 

9.    nm    10.  nm 


x^O      X  —  1  x^l   X  —  1 

V2-x^  v 


11.     lim    =^  12.  lim 


0+  X  x^0+      X  +  1 

2    ,  ,\3 


X^  "3 


13.    lim    =^  14.    lim(a;  +4) 

x^a     X  —  a  x^2 

15.  lim  (^~^       /  1>  ^ 
33^1  I  7         X  =  1. 

16.  lim  a;  sin  (  —  )  (Hint:   Use  the  fact  that  |sina|  <  1  for  any  real  number  a.    You  should 

probably  use  the  definition  of  a  limit  here.)  =^ 

17.  Give  an  e-S  proof,  similar  to  example  2.3.3,  of  the  fact  that  lim  (2a;  —  5)  =  3. 
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18.  Evaluate  the  expressions  by  reference  to  this  graph: 


—to-i 

-t  

/ 

J 

 1 

1  

 8- 

— 6- 

 1 

)  

— 4  - 

 3- 

4 

2  \ 

0 

L 

e 

v. 

» 

(a)  lim  /(x) 
(d)    lim  /(x) 
(g)    lim  fix) 

(j)  /(o) 


(b)    lim  /(x) 

(e)  hm^/(a;) 

(h)''lim  fix) 

(k)   lim  fix -A) 
K— >-i~ 


19.  Use  a  calculator  to  estimate  lim 


smx 


20.  Use  a  calculator  to  estimate  lim 


x^O  X 

tan(3x) 


x^o  tan(5a;) 


(c)  lim  /(a;) 

(f)  /(-2) 

(i)  lim  /(x  +  l) 

a:— )-0 

(1)    lim  /(x-2) 

x^0+ 


2.4    The  Derivative  Function 

We  have  seen  how  to  create,  or  derive,  a  new  function  f'{x)  from  a  function  /(x),  and 
that  this  new  function  carries  important  information.  In  one  example  we  saw  that  f'{x) 
tells  us  how  steep  the  graph  of  /(x)  is;  in  another  we  saw  that  /'(x)  tells  us  the  velocity 
of  an  object  if  /(x)  tells  us  the  position  of  the  object  at  time  x.  As  we  said  earlier,  this 
same  mathematical  idea  is  useful  whenever  /(x)  represents  some  changing  quantity  and  we 
want  to  know  something  about  how  it  changes,  or  roughly,  the  "rate"  at  which  it  changes. 
Most  functions  encountered  in  practice  are  built  up  from  a  small  collection  of  "primitive" 
functions  in  a  few  simple  ways,  for  example,  by  adding  or  multiplying  functions  together 
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to  get  new,  more  complicated  functions.  To  make  good  use  of  the  information  provided 
by  f'{x)  we  need  to  be  able  to  compute  it  for  a  variety  of  such  functions. 

We  will  begin  to  use  different  notations  for  the  derivative  of  a  function.  While  initially 
confusing,  each  is  often  useful  so  it  is  worth  maintaining  multiple  versions  of  the  same 
thing. 

Consider  again  the  function  f{x)  =  -\/ 625  —  x^.  We  have  computed  the  derivative 
f'{x)  =  —xj\l 625  —  x^,  and  have  already  noted  that  if  we  use  the  alternate  notation 
y  =  625  —  a;^  then  we  might  write  y'  =  —xj  625  —  x"^.  Another  notation  is  quite 
different,  and  in  time  it  will  become  clear  why  it  is  often  a  useful  one.  Recall  that  to 
compute  the  the  derivative  of  /  we  computed 

a/625  -  (7  +  Ax)2  -  24 
lim  —  . 

The  denominator  here  measures  a  distance  in  the  x  direction,  sometimes  called  the  "run" , 
and  the  numerator  measures  a  distance  in  the  y  direction,  sometimes  called  the  "rise,"  and 
"rise  over  run"  is  the  slope  of  a  line.  Recall  that  sometimes  such  a  numerator  is  abbreviated 
Aj/,  exchanging  brevity  for  a  more  detailed  expression.  So  in  general,  a  derivative  is  given 

by 

y  =  lim  - — . 

A2;^0  Ax 

To  recall  the  form  of  the  limit,  we  sometimes  say  instead  that 

dy  Ay 

—  =  hm  — . 
ax     Ax->-o  Ax 

In  other  words,  dy/dx  is  another  notation  for  the  derivative,  and  it  reminds  us  that  it  is 
related  to  an  actual  slope  between  two  points.  This  notation  is  called  Leibniz  notation, 
after  Gottfried  Leibniz,  who  developed  the  fundamentals  of  calculus  independently,  at 
about  the  same  time  that  Isaac  Newton  did.  Again,  since  we  often  use  /  and  f{x)  to  mean 
the  original  function,  we  sometimes  use  df  / dx  and  df{x)/dx  to  refer  to  the  derivative.  If 
the  function  f{x)  is  written  out  in  full  we  often  write  the  last  of  these  something  like  this 

d 


f'(x)  =  —  V625  -x2 

iJjtjb 

with  the  function  written  to  the  side,  instead  of  trying  to  fit  it  into  the  numerator. 


EXAMPLE  2.4.1    Find  the  derivative  of  y  =  f{t)  =  t^. 
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We  compute 

At^O  At  At 

t^  +  2tAt  +  At^  -  t^ 

=  lim  

At^O  At 

2tAt  +  At^ 
=  lim  

=  lim  2t  +  At  =  2t. 

At^O 

Remember  that  At  is  a  single  quantity,  not  a  "A"  times  a  "f,  and  so  At^  is  (At)^  not 
A{t^).  □ 

EXAMPLE  2.4.2    Find  the  derivative  of  y  =  f{x)  =  1/x. 
The  computation: 


y'  =   lim  — ^  =  lim   ^+^^  5- 

Aa;^>0  Ax       Ax^O  Ax 

X    x+Ax 

  lijji    ^(s^+^s;)  x{x+Ax) 

Ax^O  Ax 
x—{x+Ax) 

=  hm  "^"+^"^ 


Ax^O  Ax 

X  —  X  —  Ax 

hm  —  — — — 

Ax^o  x{x  +  AxjAa; 

-Ax 

hm 


Aa;^o  x{x  +  Aa;)Aa; 
lim 


Aa;^>0  x{x  +  Ax)  x'^ 


□ 


Note.  If  you  happen  to  know  some  "derivative  formulas"  from  an  earlier  course,  for 
the  time  being  you  should  pretend  that  you  do  not  know  them.  In  examples  like  the 
ones  above  and  the  exercises  below,  you  are  required  to  know  how  to  find  the  derivative 
formula  starting  from  basic  principles.  We  will  later  develop  some  formulas  so  that  we  do 
not  always  need  to  do  such  computations,  but  we  will  continue  to  need  to  know  how  to  do 
the  more  involved  computations. 

Sometimes  one  encounters  a  point  in  the  domain  of  a  function  y  =  f{x)  where  there 
is  no  derivative,  because  there  is  no  tangent  line.  In  order  for  the  notion  of  the  tangent 
line  at  a  point  to  make  sense,  the  curve  must  be  "smooth"  at  that  point.  This  means  that 
if  you  imagine  a  particle  traveling  at  some  steady  speed  along  the  curve,  then  the  particle 
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does  not  experience  an  abrupt  change  of  direction.  There  are  two  types  of  situations  you 
should  be  aware  of — corners  and  cusps — where  there's  a  sudden  change  of  direction  and 
hence  no  derivative. 

EXAMPLE  2.4.3    Discuss  the  derivative  of  the  absolute  value  function  y  =  f{x)  =  \x\. 

If  X  is  positive,  then  this  is  the  function  y  =  x,  whose  derivative  is  the  constant  1. 
(Recall  that  when  y  =  f{x)  =  mx  +  6,  the  derivative  is  the  slope  m.)  If  x  is  negative,  then 
we're  dealing  with  the  function  y  =  —x,  whose  derivative  is  the  constant  —1.  If  x  =  0, 
then  the  function  has  a  corner,  i.e.,  there  is  no  tangent  line.  A  tangent  line  would  have 
to  point  in  the  direction  of  the  curve — but  there  are  two  directions  of  the  curve  that  come 
together  at  the  origin.  We  can  summarize  this  as 

r  1  if  X  >  0; 

y'  =  <  -1  if  X  <  0; 

I  undefined    if  x  =  0. 

□ 


EXAMPLE  2.4.4 

Discuss  the  derivative  of  the  function  y  =  x^^'^,  shown  in  figure  2.4.1.  We  will  later  see 
how  to  compute  this  derivative;  for  now  we  use  the  fact  that  y'  =  {2/3)x~^^^ .  Visually  this 
looks  much  like  the  absolute  value  function,  but  it  technically  has  a  cusp,  not  a  corner.  The 
absolute  value  function  has  no  tangent  line  at  0  because  there  are  (at  least)  two  obvious 
contenders — the  tangent  line  of  the  left  side  of  the  curve  and  the  tangent  line  of  the  right 
side.  The  function  y  =  x^l"^  does  not  have  a  tangent  line  at  0,  but  unlike  the  absolute  value 
function  it  can  be  said  to  have  a  single  direction:  as  we  approach  0  from  either  side  the 
tangent  line  becomes  closer  and  closer  to  a  vertical  line;  the  curve  is  vertical  at  0.  But  as 
before,  if  you  imagine  traveling  along  the  curve,  an  abrupt  change  in  direction  is  required 
at  0:  a  full  180  degree  turn.  □ 


Figure  2.4.1     A  cusp  on  a;^ 
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In  practice  we  won't  worry  much  about  the  distinction  between  these  examples;  in  both 
cases  the  function  has  a  "sharp  point"  where  there  is  no  tangent  hne  and  no  derivative. 

Exercises  2.4- 

1.  Find  the  derivative  of  y  =  f{x)  =  \J  169  —     .  =f- 

2.  Find  the  derivative  of  y  =  f{t)  =  80  -  4.9t^.  ^ 

3.  Find  the  derivative  of  y  =  /(x)  =  i?  —  {Xjx). 

4.  Find  the  derivative  of  y  =  fix)  =  ax'^  +  bx  +  c  (where  a,  b,  and  c  are  constants).  =^ 

5.  Find  the  derivative  of  y  =  f{x)  =  x"^.  =^> 

6.  Shown  is  the  graph  of  a  function  f{x).  Sketch  the  graph  of  f'{x)  by  estimating  the  derivative 
at  a  number  of  points  in  the  intervah  estimate  the  derivative  at  regular  intervals  from  one 
end  of  the  interval  to  the  other,  and  also  at  "special"  points,  as  when  the  derivative  is  zero. 
Make  sure  you  indicate  any  places  where  the  derivative  does  not  exist. 

1.6 

1.4 
1.2 
1.0 
0.8 
0.6 
0.4 
0.2 


-1.0  -0.8  -0.6  -0.4  -0.2    0.0     0.2     0.4     0.6      0.8  1.0 

7.  Shown  is  the  graph  of  a  function  f{x).  Sketch  the  graph  of  f'{x)  by  estimating  the  derivative 
at  a  number  of  points  in  the  interval:  estimate  the  derivative  at  regular  intervals  from  one 
end  of  the  interval  to  the  other,  and  also  at  "special"  points,  as  when  the  derivative  is  zero. 
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Make  sure  you  indicate  any  places  where  the  derivative  does  not  exist. 


0  1  2  3  4  5 

8.  Find  the  derivative  of  y  =  f{x)  =  2/\/2x  +  1  =^ 

9.  Find  the  derivative  of  y  =  g{t)  =  {2t  -  l)/{t  +  2)  ^ 

10.  Find  an  equation  for  the  tangent  Hne  to  the  graph  of  f{x)  =  5  —  x  —  3x^  at  the  point  x  =  2 

=^ 

11.  Find  a  value  for  a  so  that  the  graph  of  f{x)  =  x'^  +  ax  —  3  has  a  horizontal  tangent  line  at 
X  =  4.  =J> 

2.5    Adjectives  For  Functions 

As  we  have  defined  it  in  Section  1.3,  a  function  is  a  very  general  object.  At  this  point,  it 
is  useful  to  introduce  a  collection  of  adjectives  to  describe  certain  kinds  of  functions;  these 
adjectives  name  useful  properties  that  functions  may  have.  Consider  the  graphs  of  the 
functions  in  Figure  2.5.1.  It  would  clearly  be  useful  to  have  words  to  help  us  describe  the 
distinct  features  of  each  of  them.  We  will  point  out  and  define  a  few  adjectives  (there  are 
many  more)  for  the  functions  pictured  here.  For  the  sake  of  the  discussion,  we  will  assume 
that  the  graphs  do  not  exhibit  any  unusual  behavior  off-stage  (i.e.,  outside  the  view  of  the 
graphs). 

Functions.  Each  graph  in  Figure  2.5.1  certainly  represents  a  function — since  each  passes 
the  vertical  line  test.  In  other  words,  as  you  sweep  a  vertical  line  across  the  graph  of  each 
function,  the  line  never  intersects  the  graph  more  than  once.  If  it  did,  then  the  graph 
would  not  represent  a  function. 

Bounded.  The  graph  in  (c)  appears  to  approach  zero  as  x  goes  to  both  positive  and 
negative  infinity.  It  also  never  exceeds  the  value  1  or  drops  below  the  value  0.  Because  the 


54     Chapter  2  Instantaneous  Rate  of  Change:  The  Derivative 


(c) 


-2 


Figure  2.5.1     Function  Types:  (a)  a  discontinuous  function,  (b)  a  continuous  function,  (c) 
a  bounded,  differentiable  function,  (d)  an  unbounded,  differentiable  function 

graph  never  increases  or  decreases  without  bound,  we  say  that  the  function  represented  by 
the  graph  in  (c)  is  a  bounded  function. 


DEFINITION  2.5.1  Bounded  A  function  /  is  bounded  if  there  is  a  number  M 
such  that  \f{x)\  <  M  for  every  x  in  the  domain  of  /.  n 

For  the  function  in  (c),  one  such  choice  for  M  would  be  10.  However,  the  smallest 
(optimal)  choice  would  be  M  =  1.  In  either  case,  simply  finding  an  M  is  enough  to 
establish  boundedness.  No  such  M  exists  for  the  hyperbola  in  (d)  and  hence  we  can  say 
that  it  is  unbounded. 

Continuity.  The  graphs  shown  in  (b)  and  (c)  both  represent  continuous  functions. 
Geometrically,  this  is  because  there  are  no  jumps  in  the  graphs.  That  is,  if  you  pick  a 
point  on  the  graph  and  approach  it  from  the  left  and  right,  the  values  of  the  function 
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approach  the  value  of  the  function  at  that  point.  For  example,  we  can  see  that  this  is  not 
true  for  function  values  near  x  =  —1  on  the  graph  in  (a)  which  is  not  continuous  at  that 
location. 

DEFINITION  2.5.2  Continuous  at  a  Point  A  function  /  is  continuous  at  a  point 
a  if  lim  f{x)  —  f{a).  □ 

x—>-a 

DEFINITION  2.5.3  Continuous  A  function  /  is  continuous  if  it  is  continuous  at 
every  point  in  its  domain.  □ 

Strangely,  we  can  also  say  that  (d)  is  continuous  even  though  there  is  a  vertical  asymp- 
tote. A  careful  reading  of  the  definition  of  continuous  reveals  the  phrase  "at  every  point 
in  its  domain."  Because  the  location  of  the  asymptote,  x  =  0,  is  not  in  the  domain  of 
the  function,  and  because  the  rest  of  the  function  is  well-behaved,  we  can  say  that  (d)  is 
continuous. 

Differentiability.  Now  that  we  have  introduced  the  derivative  of  a  function  at  a  point, 
we  can  begin  to  use  the  adjective  differentiable.  We  can  see  that  the  tangent  line  is  well- 
defined  at  every  point  on  the  graph  in  (c).  Therefore,  we  can  say  that  (c)  is  a  differentiable 
function. 

DEFINITION  2.5.4  Different iable  at  a  Point  A  function  /  is  differentiable  at 
point  a  if  /'(a)  exists.  □ 

DEFINITION  2.5.5  Differentiable  A  function  /  is  differentiable  if  is  differentiable 
at  every  point  (excluding  endpoints  and  isolated  points  in  the  domain  of  /)  in  the  domain 
of/.  □ 

Take  note  that,  for  technical  reasons  not  discussed  here,  both  of  these  definitions 
exclude  endpoints  and  isolated  points  in  the  domain  from  consideration. 

We  now  have  a  collection  of  adjectives  to  describe  the  very  rich  and  complex  set  of 
objects  known  as  functions. 

We  close  with  a  useful  theorem  about  continuous  functions: 

THEOREM  2.5.6  Intermediate  Value  Theorem  If  /  is  continuous  on  the  interval 
[a,  b]  and  d  is  between  /(a)  and  f{b),  then  there  is  a  number  c  in  [a,  b]  such  that  /(c)  =  d. 


This  is  most  frequently  used  when  d  —  0. 
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EXAMPLE  2.5.7  Explain  why  the  function  f  =  +  3x^  +  x  —  2  has  a  root  between 
0  and  1. 

By  theorem  2.3.6,  /  is  continuous.  Since  /(O)  =  —2  and  /(I)  =  3,  and  0  is  between 


This  example  also  points  the  way  to  a  simple  method  for  approximating  roots. 
EXAMPLE  2.5.8    Approximate  the  root  of  the  previous  example  to  one  decimal  place. 


If  we  compute  /(O.l),  /(0.2),  and  so  on,  we  find  that  /(0.6)  <  0  and  /(0.7)  >  0,  so  by 
the  Intermediate  Value  Theorem,  /  has  a  root  between  0.6  and  0.7.  Repeating  the  process 
with  /(0.61),  /(0.62),  and  so  on,  we  find  that  /(0.61)  <  0  and  /(0.62)  >  0,  so  /  has  a  root 


Exercises  2.5. 

1.  Along  the  lines  of  Figure  2.5.1,  for  each  part  below  sketch  the  graph  of  a  function  that  is: 

a.  bounded,  but  not  continuous. 

b.  differentiable  and  unbounded. 

c.  continuous  at  a;  =  0,  not  continuous  at  a;  =  1,  and  bounded. 

d.  differentiable  everywhere  except  at  x  =  —1,  continuous,  and  unbounded. 

2.  Is  f{x)  =  sin(x)  a  bounded  function?  If  so,  find  the  smallest  M. 

3.  Is  s{t)  =         +t^)  a  bounded  function?  If  so,  find  the  smallest  M. 

4.  Is  v{u)  =  21n|«|  a  bounded  function?  If  so,  find  the  smallest  M. 

5.  Consider  the  function 


2  and  3,  there  is  a  c  G  [0, 1]  such  that  /(c)  =  0. 


□ 


between  0.61  and  0.62,  and  the  root  is  0.6  rounded  to  one  decimal  place. 


□ 


6. 


7. 


Show  that  it  is  continuous  at  the  point  a;  =  0.  Is  ft.  a  continuous  function? 
Approximate  a  root  of  /  =      —  4x'^  +  23;  +  2  to  one  decimal  place. 
Approximate  a  root  oi  f  =  x'^  +      —  bx  +  1  to  one  decimal  place. 


3 

Rules  for  Finding  Derivatives 


It  is  tedious  to  compute  a  limit  every  time  we  need  to  know  the  derivative  of  a  function. 
Fortunately,  we  can  develop  a  small  collection  of  examples  and  rules  that  allow  us  to 
compute  the  derivative  of  almost  any  function  we  are  likely  to  encounter.  Many  functions 
involve  quantities  raised  to  a  constant  power,  such  as  polynomials  and  more  complicated 
combinations  like  y  =  (sinx)^.  So  we  start  by  examining  powers  of  a  single  variable;  this 
gives  us  a  building  block  for  more  complicated  examples. 


We  start  with  the  derivative  of  a  power  function,  f{x)  =  .  Here  n  is  a  number  of 
any  kind:  integer,  rational,  positive,  negative,  even  irrational,  as  in  x'^ .  We  have  already 
computed  some  simple  examples,  so  the  formula  should  not  be  a  complete  surprise: 


It  is  not  easy  to  show  this  is  true  for  any  n.  We  will  do  some  of  the  easier  cases  now,  and 
discuss  the  rest  later. 

The  easiest,  and  most  common,  is  the  case  that  n  is  a  positive  integer.  To  compute 
the  derivative  we  need  to  compute  the  following  limit: 


For  a  specific,  fairly  small  value  of  n,  we  could  do  this  by  straightforward  algebra. 


3.1    The  Power  Rule 


d 
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EXAMPLE  3.1.1    Find  the  derivative  of  f{x)  =  x^. 

do  (x  +  Ax)^  -  x^ 

—x'^  =  lim  ^  . 

dx        Ax->-o  Ax 

x^ +  3x'^Ax +  3xAx'^  +  Ax^  -  x^ 
=  lim   T  . 

Ax^O  Ax 

Sx'^Ax  +  SxAx'^  +  Ax^ 
=  lim   T  . 

Ax^O  Ax 

=  lim  3x'^ +  3xAx  +  Ax'^  =  3x'^. 

Ax^O 

□ 

The  general  case  is  really  not  much  harder  as  long  as  we  don't  try  to  do  too  much. 
The  key  is  understanding  what  happens  when  {x  +  Aa;)"  is  multiplied  out: 

{x  +  Ax)'^  =      +  nx^-'^Ax  +  a2X^~'^ Ax^  ^  V  ^an-\xAx^~^  +  Ax". 

We  know  that  multiplying  out  will  give  a  large  number  of  terms  all  of  the  form  x^  Ax^  ^  and 
in  fact  that  j  =  n'm.  every  term.  One  way  to  see  this  is  to  understand  that  one  method 
for  multiplying  out  [x  +  Ax)"^  is  the  following:  In  every  [x  +  Ax)  factor,  pick  either  the  x 
or  the  Ax,  then  multiply  the  n  choices  together;  do  this  in  all  possible  ways.  For  example, 
for  {x  +  Ax)^,  there  are  eight  possible  ways  to  do  this: 

[x  +  Ax^{x  +  Ax^{x  +  Ax)  =  XXX  +  xxAx  +  xAxx  +  xAxAx 

+  Axxx  +  AxxAx  +  AxAxx  +  Ax  Ax  Ax 
=     +  x^Ax  +  x^Ax  +  xAx^ 

+  x^Ax  +  xAx^  +  xAx^  +  Ax^ 
=  x^  +  3x^  Ax  +  3xAx^  +  Ax^ 

No  matter  what  n  is,  there  are  n  ways  to  pick  Ax  in  one  factor  and  x  in  the  remaining 
n  —  1  factors;  this  means  one  term  is  nx"^""*^  Ax.  The  other  coefficients  are  somewhat  harder 
to  understand,  but  we  don't  really  need  them,  so  in  the  formula  above  they  have  simply 
been  called  a2,  03,  and  so  on.  We  know  that  every  one  of  these  terms  contains  Ax  to  at 
least  the  power  2.  Now  let's  look  at  the  limit: 

cZ   „      ,       (x  +  Ax)"  -  x" 
-x"  =  hm  ' 


dx  Aa;-)-0  Ax 

x"  +  nx"~^  Ax  +  a2x"-^Ax^  H  V  a„_ixAx"-^  +  Ax' 


=  lim 


Aa;-)-0  Ax 

nx"-^  Ax  +  a2x"~2  Ax^  H  V  an-ixAx^~^  +  Ax" 

lim  

Aa;-)-0  Ax 

lim  nx""-^  +  a2x"~^Ax  H  V  a„_ixAx"~^  +  Ax""-^  =  nx"~^ 

Ax^Q 
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Now  without  much  trouble  we  can  verify  the  formula  for  negative  integers.  First  let's 
look  at  an  example: 


EXAMPLE  3.1.2  Find  the  derivative  of  y  =  x  ^.  Using  the  formula,  y'  =  -3x  ^  ^  = 
-3x-*.  □ 

Here  is  the  general  computation.  Suppose  n  is  a  negative  integer;  the  algebra  is  easier 
to  follow  if  we  use  n  =  —m  in  the  computation,  where  m  is  a  positive  integer. 

d    „      d    _^  (x  +  Ax) 

—x   =  —x      =  lim  ^  — 

dx         dx  Ax^o  Ax 

1  1 


nm 


-(x  +  Ax) 


lim  - —    ^   ,  ^ 
Ax^o  (x  +  Ax)'^x"^Ax 

_  /^m  ^  mx'^-^Ax  +  a2x"'-^Ax^  H  +  a^-ixAx"^-^  +  Ax"") 

1™   7  X  ^  X  

Ax^o  (x  +  Ax)""x""Ax 

-mx'^~^  —  a2X™'~^Ax  —  ...  —  am-i^Ax"""^  —  Ax""""*^) 


=       1™  /  A  N 

Ax^o  (x  +  Ax)""x"" 

^  _jnx         ^  _nix         ^  _^^^_i_2^  ^  nx-^-^  =  nx^-\ 

We  will  later  see  why  the  other  cases  of  the  power  rule  work,  but  from  now  on  we  will 
use  the  power  rule  whenever  n  is  any  real  number.  Let's  note  here  a  simple  case  in  which 
the  power  rule  applies,  or  almost  applies,  but  is  not  really  needed.  Suppose  that  /(x)  =  1; 
remember  that  this  "1"  is  a  function,  not  "merely"  a  number,  and  that  /(x)  =  1  has  a 
graph  that  is  a  horizontal  line,  with  slope  zero  everywhere.  So  we  know  that  /'(x)  =  0. 
We  might  also  write  /(x)  =  x°,  though  there  is  some  question  about  just  what  this  means 
at  X  =  0.  If  we  apply  the  power  rule,  we  get  /'(x)  =  Ox~^  =  0/x  =  0,  again  noting  that 
there  is  a  problem  at  x  =  0.  So  the  power  rule  "works"  in  this  case,  but  it's  really  best  to 
just  remember  that  the  derivative  of  any  constant  function  is  zero. 

Exercises  3.1. 

Find  the  derivatives  of  the  given  functions. 

1  100  ^  r,  -100  ^ 

1.    X      =^  2.    a;  =^ 

3.    ^  ^  4.    a;"  ^ 

5.   x^/^^  6.   x-^^'  ^ 
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3.2    Linearity  of  the  Derivative 

An  operation  is  linear  if  it  behaves  "nicely"  with  respect  to  multiplication  by  a  constant 
and  addition.  The  name  comes  from  the  equation  of  a  line  through  the  origin,  f{x)  =  mx, 
and  the  following  two  properties  of  this  equation.  First,  f{cx)  —  m{cx)  —  c{mx)  —  cf{x), 
so  the  constant  c  can  be  "moved  outside"  or  "moved  through"  the  function  /.  Second, 
f{x  +  y)  =  m{x  +  y)  =  mx  +  my  =  f{x)  +  f{y),  so  the  addition  symbol  likewise  can  be 
moved  through  the  function. 

The  corresponding  properties  for  the  derivative  are: 

and 

{f{x)+g{x)y  =  ^im+9i^))  =  ^/(^)  +  ^9i^)  =  f{^)+9'{x). 

It  is  easy  to  see,  or  at  least  to  believe,  that  these  are  true  by  thinking  of  the  dis- 
tance/speed interpretation  of  derivatives.  If  one  object  is  at  position  f{t)  at  time  t,  we 
know  its  speed  is  given  by  f'{t).  Suppose  another  object  is  at  position  5/(t)  at  time  t, 
namely,  that  it  is  always  5  times  as  far  along  the  route  as  the  first  object.  Then  it  "must" 
be  going  5  times  as  fast  at  all  times. 

The  second  rule  is  somewhat  more  complicated,  but  here  is  one  way  to  picture  it. 
Suppose  a  flat  bed  railroad  car  is  at  position  f{t)  at  time  t,  so  the  car  is  traveling  at  a 
speed  of  fit)  (to  be  specific,  let's  say  that  fit)  gives  the  position  on  the  track  of  the  rear 
end  of  the  car).  Suppose  that  an  ant  is  crawling  from  the  back  of  the  car  to  the  front  so 
that  its  position  on  the  car  is  g{t)  and  its  speed  relative  to  the  car  is  g'it).  Then  in  reality, 
at  time  t,  the  ant  is  at  position  f{t)  +  g{t)  along  the  track,  and  its  speed  is  "obviously" 

nt)+9'{t). 

We  don't  want  to  rely  on  some  more-or-less  obvious  physical  interpretation  to  deter- 
mine what  is  true  mathematically,  so  let's  see  how  to  verify  these  rules  by  computation. 
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We'll  do  one  and  leave  the  other  for  the  exercises. 

-^{fix)+  g{x))  =  lim 
ax  Ax->-o 


lim 


fix 

+  Ax) 

+  gix 

+  Ax) 

-  ifi< 

)  +  gix)) 

Ax 

fix 

+  Ax) 

+  gix 

+  Ax) 

-fi^) 

-gix) 

Ax 

fix 

+  Ax) 

-fix 

)+gix 

+  Ax) 

-  gix) 

lim  ^ 

Ax^O  Ax 

^  ^.^  ,  fix  +  ^x)  -  fix)  ^  gix  +  Ax) -gix) 


Ax^O  \  Ax  Ax 

^  ^.^  fix  +  Ax)  -  fix)  ^  ^.^  g{x  +  Ax)  -  gix) 

Ax-^0  Ax  Ax^O  Ax 

=  f'ix)+g'ix) 
This  is  sometimes  called  the  sum  rule  for  derivatives. 

EXAMPLE  3.2.1  Find  the  derivative  of  /(x)  =  x^  +  5x^.  We  have  to  invoke  linearity 
twice  here: 

f'(x)  =  -^(x^  +  5x2)  ^  d^5  ^  _l(5a;2^)  ^  5^4  ^  5-^(x'^)  =  5a;4  +  5  .  2x^  =  5x^  +  lOx. 
ax  ax        ax  ax 

□ 

Because  it  is  so  easy  with  a  little  practice,  we  can  usually  combine  all  uses  of  linearity 
into  a  single  step.  The  following  example  shows  an  acceptably  detailed  computation. 

EXAMPLE  3.2.2    Find  the  derivative  of  /(x)  =  3/x^  -  2x^  +  6x  -  7. 

f(x)  =       (  ^  -  2x2  +  6x  -  7  )  =  -^(3x-^  -  2x2  +  6x  -  7)  =  -12x-^  -  4x  +  6. 
ax  \x*  /  ax 


□ 


Exercises  3.2. 

Find  the  derivatives  of  the  functions  in  1-6. 

1.  +  I2x^  -  15  ^ 

2.  -Ax^  +  ?,x^  -hjx^  =^ 

3.  5(-3x^  +  5x  +  l)  ^ 

4.  f{x)  +  g{x),  where  f{x)  =  x'^  —  3x  +  2  and  g{x)  =  2x^  —  5x 

5.  (x  +  l)(x^  +  2x-3) 

6.  \/625  -  x2  +  3x^  +  12  (See  section  2.1.)  ^ 

7.  Find  an  equation  for  the  tangent  Hne  to  f{x)  =  x^ /i  —  1/a;  at  a;  =  —2. 
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8.  Find  an  equation  for  the  tangent  line  to  f{x)  =        —  tt^  at  x  =  4.  =^ 

9.  Suppose  the  position  of  an  object  at  time  t  is  given  by  f{t)  =  — 49t^/10  +  5t  +  10.  Find  a 
function  giving  the  speed  of  the  object  at  time  t.  The  acceleration  of  an  object  is  the  rate  at 
which  its  speed  is  changing,  which  means  it  is  given  by  the  derivative  of  the  speed  function. 
Find  the  acceleration  of  the  object  at  time  t.  =^ 

10.  Let  f{x)  =      and  c  =  3.  Sketch  the  graphs  of  /,  cf,  f ,  and  (c/)'  on  the  same  diagram. 

n 

11.  The  general  polynomial  P  of  degree  n  in  the  variable  x  has  the  form  P{x)  =        akX^  = 

k  =  0 

ao  +  aix  +  . . .  +  UnX^.  What  is  the  derivative  (with  respect  to  x)  of  P?  =^ 

12.  Find  a  cubic  polynomial  whose  graph  has  horizontal  tangents  at  (—2,  5)  and  (2,  3).  =^ 

13.  Prove  that  -^{cf{x))  =  cf'{x)  using  the  definition  of  the  derivative. 

14.  Suppose  that  /  and  g  are  differentiable  at  x.  Show  that  f  —  g  is  differentiable  at  x  using  the 
two  linearity  properties  from  this  section. 


3.3    The  Product  Rule 

Consider  the  product  of  two  simple  functions,  say  f{x)  =  (x^  +  ~  3a;).  An  obvious 

guess  for  the  derivative  of  /  is  the  product  of  the  derivatives  of  the  constituent  functions: 
(2a;)  (3x^  —  3)  =  6x^  —  6x.  Is  this  correct?  We  can  easily  check,  by  rewriting  /  and  doing  the 
calculation  in  a  way  that  is  known  to  work.  First,  f{x)  =  x^ —  3x^ +x^ —  3x  =  x^  —  2x^  —  3x, 
and  then  f'{x)  =  —  6x^  —  3.  Not  even  close!  What  went  "wrong"?  Well,  nothing  really, 
except  the  guess  was  wrong. 

So  the  derivative  of  f{x)g{x)  is  NOT  as  simple  as  f'{x)g'{x).  Surely  there  is  some 
rule  for  such  a  situation?  There  is,  and  it  is  instructive  to  "discover"  it  by  trying  to  do 
the  general  calculation  even  without  knowing  the  answer  in  advance. 

d  f{x  +  Ax)g{x  +  Ax)  -  f{x)g{x) 

--{f{x)g{x))  =  hm  

ax  Ax^o  Ax 


y      /(x  +  Ax)g{x  +  Ax)  —  /(x  +  Ax)g{x)  +  /(x  +  Ax)g{x)  —  f{x)g{x) 

Ax-^0  Ax 

^  ^.^  /(x  +  Ax)g{x  +  Ax)  -  /(x  +  Ax)g{x)  _^  ^.^  /(x  +  Ax)fir(x)  -  f{x)g{x] 

Ax^.0  Ax  Ax^O  Ax 

r            1  A  ^9{x  +  Ax)  -  g{x)             /(x  +  Ax)  - /(x) 
=  hm  r  X  +  Ax  h  nm  gr  x) 

Ax^O''^  '  Ax  Ax^O  Ax  ' 

=  f{x)g'{x)  +  f{x)g{x) 


A  couple  of  items  here  need  discussion.  First,  we  used  a  standard  trick,  "add  and  subtract 
the  same  thing" ,  to  transform  what  we  had  into  a  more  useful  form.  After  some  rewriting, 
we  realize  that  we  have  two  limits  that  produce  /'(x)  and  g'{x).  Of  course,  /'(x)  and 
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g'{x)  must  actually  exist  for  this  to  make  sense.  We  also  replaced   lim  f{x  +  Ax)  with 

f{x) — why  is  this  justified? 

What  we  really  need  to  know  here  is  that  lim  /(x  +  Ax)  =  /(x),  or  in  the  language 

Ax^O 

of  section  2.5,  that  /  is  continuous  at  x.  We  already  know  that  /'(x)  exists  (or  the  whole 
approach,  writing  the  derivative  of  fg  in  terms  of  /'  and  g' ,  doesn't  make  sense).  This 
turns  out  to  imply  that  /  is  continuous  as  well.  Here's  why: 

lim  /(x  +  Ax)  =  lim  (/(x  +  Ax)  —  /(x)  +  /(x)) 

Ax^O  Ax^O 

=  lim  /0^±M^/Max+  lim  /(X) 

Ax^O  Ax  Ax^O 

=  /'(x)-0  +  /(x)  =  /(x) 
To  summarize:  the  product  rule  says  that 

^(/(x)^(x))  =  f{x)g'{x)  +  f'{x)g{x). 

Returning  to  the  example  we  started  with,  let  /(x)  =  (x^  +  l)(x"^  —  3x).  Then  /'(x)  = 
(x^  +  l)(3x2  -  3)  +  (2x)(x^  -  3x)  =  3x^  -  3x'^  +  3x^-3  +  2x^  -  Gx^  =  5x^  -  Gx^  -  3, 
as  before.  In  this  case  it  is  probably  simpler  to  multiply  /(x)  out  first,  then  compute  the 
derivative;  here's  an  example  for  which  we  really  need  the  product  rule. 

EXAMPLE  3.3.1     Compute  the  derivative  of  /(x)  =  x^  v^625  —  x^.  We  have  already 

d    I   — X 

computed  —  v625  —  x^  =  =.  Now 

dx  \/625  -  x2 

2  ,0    f^r.  2      -x3  +  2x(625  -x2)      -3x3  +  1250x 

r  X  =  X  =  +  2xv  625  —  x^  =   ,   =    — . 

V625  -  x2  V625  -  x2  V625  -  x^ 


□ 


Exercises  3.3. 

In  1-4,  find  the  derivatives  of  the  functions  using  the  product  rule. 

1.  x^{x^  -  5x  +  10)  ^ 

2.  {x^  +  bx-  ^){x^  -  Qx^  +  Zx^  -  7a;  +  1)  ^ 


3.  v^\/625  -  a;2  ^ 
V625  - 

5.  Use  the  product  rule  to  compute  the  derivative  of  fix)  =  {2x  —  3)^.  Sketch  the  function. 
Find  an  equation  of  the  tangent  line  to  the  curve  at  x  =  2.  Sketch  the  tangent  line  at  x  =  2. 
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6.  Suppose  that  /,  g,  and  h  are  differentiable  functions.  Show  that  {fgh)'{x)  =  f  {x)g{x)h{x)  + 

f{x)g'{x)h{x)  +  f{x)g{x)h'{x). 

7.  State  and  prove  a  rule  to  compute  {fghi)'{x),  similar  to  the  rule  in  the  previous  problem. 

Product  notation.  Suppose  fi,  f2,  ■  ■  ■  fn  are  functions.  The  product  of  all  these  functions  can 
be  written 

n 

fc=i 

This  is  similar  to  the  use  of       to  denote  a  sum.  For  example, 

5 

n  /fe  =  /1/2/3/4/5 
k=l 

and 

n 

Ylk  =  1-2  -  ...■n  =  n\. 

k  =  l 

We  sometimes  use  somewhat  more  complicated  conditions;  for  example 

n 

n  A 

denotes  the  product  of  /i  through  /„  except  for  fj.  For  example, 

5 

nk  2        3        5  11 

X     =  X  ■  X    ■  X    ■  X    =  X  . 

fe=l,fe#4 

8.  The  generalized  product  rule  says  that  if  /i,  /2, .  • . ,  /«,  are  differentiable  functions  at  x 
then 

,      n  ^      /  ^  \ 

I: n^(-)=E     n  M-)]- 

k=l  3=1  \  k=l,k^j  J 

Verify  that  this  is  the  same  as  your  answer  to  the  previous  problem  when  n  =  4,  and  write 
out  what  this  says  when  n  =  5. 

3.4    The  Quotient  Rule 

What  is  the  derivative  of  {x^  +  l)/{x^  —  Sx)7  More  generally,  we'd  like  to  have  a  formula 
to  compute  the  derivative  of  f{x)/g{x)  if  we  already  know  f'{x)  and  g'{x).  Instead  of 
attacking  this  problem  head-on,  let's  notice  that  we've  already  done  part  of  the  problem: 
f{x)/g{x)  =  f{x)  ■  {l/g{x))^  that  is,  this  is  "really"  a  product,  and  we  can  compute  the 
derivative  if  we  know  f'{x)  and  {l/g{x)y.  So  really  the  only  new  bit  of  information  we 
need  is  {l/g{x)y  in  terms  of  g'{x).  As  with  the  product  rule,  let's  set  this  up  and  see  how 
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far  we  can  get: 


1      ^    ^.^     g{x+Ax)  gjx) 

dx  g{x)     Ax^o  Ax 

9(x)-gix+Ax) 

_  g{x+Ax)g{x) 

Ax^O  Ax 

^  ^.^  gjx)  -  gjx  +  Ax) 
Ax^o  g[x  +  Ax)g{x)Ax 

-  lim  ~  ^^^^ 


Ax^o  Ax  g{x  +  Ax)g{x) 


g{xY 

Now  we  can  put  this  together  with  the  product  rule: 

AM  =  f(x)^^  +  fix)—  =  =  nx)g{x)-f{x)g'{x)^ 
dx  g{x)             g{xY             g{x)                  g{xY  g{xY 

EXAMPLE  3.4.1     Compute  the  derivative  of  (x^  +  -  3x). 

d  x^  +  1  _  2x{x^  -  3x)  -  {x^  +  l){3x^  -  3)  _  -x^  -  Qx^  +  3 
dx  x^  —  3x  {x^  —  3x)'^  {x^  —  3x)'^  ^ 

It  is  often  possible  to  calculate  derivatives  in  more  than  one  way,  as  we  have  already 
seen.  Since  every  quotient  can  be  written  as  a  product,  it  is  always  possible  to  use  the 
product  rule  to  compute  the  derivative,  though  it  is  not  always  simpler. 

EXAMPLE  3.4.2   Find  the  derivative  of  V 625  —  x"^ l\fx  in  two  ways:  using  the  quotient 
rule,  and  using  the  product  rule. 
Quotient  rule: 


d  V625  -  a;2  _  v^(-a;/V625  -  x'^)  -  V625  -  a;2  •  1/(2^^) 
dx      \fx  X 

Note  that  we  have  used  \fx  =  x^l"^  to  compute  the  derivative  of  \fx  by  the  power  rule. 
Product  rule: 


dx  2 
With  a  bit  of  algebra,  both  of  these  simplify  to 

x^  +  625 


2V625  -  a;2a;3/2 " 

□ 
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Occasionally  you  will  need  to  compute  the  derivative  of  a  quotient  with  a  constant 
numerator,  like  Of  course  you  can  use  the  quotient  rule,  but  it  is  usually  not  the 

easiest  method.  If  we  do  use  it  here,  we  get 

d  10  _  x2  ■  0  -  10  ■  2x  _  -20 

dx  x'^  x^  ' 

since  the  derivative  of  10  is  0.  But  it  is  simpler  to  do  this: 

10  d 


dx  x"^  dx 


10x"2  =  -20x"^ 


Admittedly,  is  a  particularly  simple  denominator,  but  we  will  see  that  a  similar  calcu- 
lation is  usually  possible.  Another  approach  is  to  remember  that 

d    1    ^  ~g'{x) 
dx  g{x)       g{xY  ' 

but  this  requires  extra  memorization.  Using  this  formula, 

10  _  ^Q-2x 

dx  x'^  x^ 

Note  that  we  first  use  linearity  of  the  derivative  to  pull  the  10  out  in  front. 
Exercises  3.4- 

Find  the  derivatives  of  the  functions  in  1-4  using  the  quotient  rule. 

+  5x  —  3  . 


—  5x  +  10  x^  —  6x'^  +  3x^  —  7x  +  1 

X  .     \/625  —  x'^ 


^625" 


X 


2 


5.  Find  an  equation  for  the  tangent  line  to  f{x)  =  {x^  —  4)/(5  —  x)  at  x  =  3.  =^> 

6.  Find  an  equation  for  the  tangent  hne  to  f{x)  =  {x  —  2)/{x^  +  4x  —  1)  at  x  =  1.  =^> 

7.  Let  P  be  a  polynomial  of  degree  n  and  let  Q  be  a  polynomial  of  degree  m  (with  Q  not  the 
zero  polynomial).  Using  sigma  notation  we  can  write 

n  m 

-P  =  ^afex'',        Q  =  bkx^. 

fe=0  fe=0 

Use  sigma  notation  to  write  the  derivative  of  the  rational  function  P/Q. 

8.  The  curve  y  =  1/(1  +  x'^)  is  an  example  of  a  class  of  curves  each  of  which  is  called  a  witch 
of  Agnesi.  Sketch  the  curve  and  find  the  tangent  line  to  the  curve  at  x  =  5.  (The  word 
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witch  here  is  a  mistranslation  of  the  original  Italian,  as  described  at 

http: //mathworld. wolf ram. com/WitchofAgnesi .html 

and 

http : / /instructional! . calstatela . edu/sgray/Agnesi/ 
WitchHistory/Historyn amewitch.html.) 

9.  If  /(4)  =  5,  g'{A)  =  12,  {fg){A)  =  /(4)<7(4)  =  2,  and  g{A)  =  6,  compute  /(4)  and  at  4. 


3.5    The  Chain  Rule 

So  far  we  have  seen  how  to  compute  the  derivative  of  a  function  built  up  from  other 
functions  by  addition,  subtraction,  multiplication  and  division.  There  is  another  very 
important  way  that  we  combine  simple  functions  to  make  more  complicated  functions: 
function  composition,  as  discussed  in  section  2.3.  For  example,  consider  a/ 625  —  x^.  This 
function  has  many  simpler  components,  like  625  and  x^,  and  then  there  is  that  square  root 
symbol,  so  the  square  root  function  ^/x  =  x^^'^  is  involved.  The  obvious  question  is:  can 
we  compute  the  derivative  using  the  derivatives  of  the  constituents  625  —  x'^  and  y/x?  We 
can  indeed.  In  general,  if  f{x)  and  g{x)  are  functions,  we  can  compute  the  derivatives  of 
f{g{x))  and  g{f{x))  in  terms  of  f'{x)  and  g'{x). 


EXAMPLE  3.5.1     Form  the  two  possible  compositions  of  f{x)  =  \fx  and  g{x 


625  —  x^  and  compute  the  derivatives.  First,  f{g{x))  =  v625  — x^,  and  the  derivative 


is  —x/ V  625  —  as  we  have  seen.  Second,  g{f{x))  =  625  —  {y/x)"^  =  625  —  x  with  deriva- 
tive —  1.  Of  course,  these  calculations  do  not  use  anything  new,  and  in  particular  the 
derivative  of  f{g{x))  was  somewhat  tedious  to  compute  from  the  definition.  □ 

Suppose  we  want  the  derivative  of  f{g{x)).  Again,  let's  set  up  the  derivative  and  play 
some  algebraic  tricks: 

d  ff  f           r      f{9{x  +  Ax))  -  f{g{x)) 
-f{g{x))  =  lun^  

f{g{x  +  Ax))  -  f{g{x))  g{x  +  Ax))  -  g{x) 


=  lim 


Aa::^.0       g{x  +  Ax))  —  g{x)  Ax 

Now  we  see  immediately  that  the  second  fraction  turns  into  g'{x)  when  we  take  the  limit. 
The  first  fraction  is  more  complicated,  but  it  too  looks  something  like  a  derivative.  The 
denominator,  g{x  +  Ax))  —  g{x),  is  a  change  in  the  value  of  g,  so  let's  abbreviate  it  as 
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=  g{x  +  Ax))  —  g{x),  which  also  means  g{x  +  Ax)  =  g{x)  +  Ag.  This  gives  us 

^.^  fjgjx)  +  Ag)  -  fjgjx))  ^ 
Ax-^o  Ag 

As  Ax  goes  to  0,  it  is  also  true  that  Ag  goes  to  0,  because  g{x  +  Ax)  goes  to  g{x).  So  we 
can  rewrite  this  limit  as 

^.^  f{g{x)  +  Ag)-f{g{x))^ 

Ag-)-0  A^r 

Now  this  looks  exactly  like  a  derivative,  namely  f'{g{x)),  that  is,  the  function  f'{x)  with 
X  replaced  by  g{x).  If  this  all  withstands  scrutiny,  we  then  get 

^/(^(^))  =  f'{9{x))g'{x). 

Unfortunately,  there  is  a  small  flaw  in  the  argument.  Recall  that  what  we  mean  by  limAx-).o 
involves  what  happens  when  Ax  is  close  to  0  but  not  equal  to  0.  The  qualification  is  very 
important,  since  wc  must  be  able  to  divide  by  Ax.  But  when  Aa;  is  close  to  0  but  not  equal 
to  0,  Ag  —  g{x  +  Ax))  —  g{x)  is  close  to  0  and  possibly  equal  to  0.  This  means  it  doesn't 
really  make  sense  to  divide  by  A^f.  Fortunately,  it  is  possible  to  recast  the  argument  to 
avoid  this  difficulty,  but  it  is  a  bit  tricky;  we  will  not  include  the  details,  which  can  be 
found  in  many  calculus  books.  Note  that  many  functions  g  do  have  the  property  that 
g{x  +  Ax)  —  g{x)  ^  0  when  Ax  is  small,  and  for  these  functions  the  argument  above  is 
fine. 

The  chain  rule  has  a  particularly  simple  expression  if  we  use  the  Leibniz  notation  for 
the  derivative.  The  quantity  f'{g{x))  is  the  derivative  of  /  with  x  replaced  by  g;  this  can 
be  written  df/dg.  As  usual,  g'{x)  =  dg/dx.  Then  the  chain  rule  becomes 

df^  ^  dj^d^ 
dx      dg  dx 

This  looks  like  trivial  arithmetic,  but  it  is  not:  dg/dx  is  not  a  fraction,  that  is,  not  literal 
division,  but  a  single  symbol  that  means  g'{x).  Nevertheless,  it  turns  out  that  what  looks 
like  trivial  arithmetic,  and  is  therefore  easy  to  remember,  is  really  true. 

It  will  take  a  bit  of  practice  to  make  the  use  of  the  chain  rule  come  naturally — it  is 
more  complicated  than  the  earlier  differentiation  rules  we  have  seen. 

EXAMPLE  3.5.2  Compute  the  derivative  of  \/625  -  a;^.  We  already  know  that  the 
answer  is  —x/ a/ 625  —  x"^,  computed  directly  from  the  limit.  In  the  context  of  the  chain 
rule,  we  have  f{x)  =  ^/x,  g{x)  =  625  —  x^.    We  know  that  f'{x)  =  {l/2)x~^^^,  so 
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f'{g{x))  =  (l/2)(625  —  x'^)  ^1"^ .  Note  that  this  is  a  two  step  computation:  first  compute 
f'{x),  then  replace  x  by  g{x).  Since  g'{x)  =  —2x  we  have 

f'igixWix)  =  ,  J,  J-2x) 


2V625  -  x2  \/625  -  x"^ 

□ 

EXAMPLE  3.5.3  Compute  the  derivative  of  l/-\/625  -  x'^.  This  is  a  quotient  with  a 
constant  numerator,  so  we  could  use  the  quotient  rule,  but  it  is  simpler  to  use  the  chain 
rule.  The  function  is  (625  — a;^)~^/^,  the  composition  of  f{x)  =  x~^^'^  and  g{x)  =  625  — x^. 
We  compute  f'{x)  =  {—l/2)x~^^'^  using  the  power  rule,  and  then 

f{g{x))g'{x)  =  ^~\,,,,J-2x) 


2(625  -a;2)3/2^       '      {625 -x^)^/^' 

□ 

In  practice,  of  course,  you  will  need  to  use  more  than  one  of  the  rules  we  have  developed 
to  compute  the  derivative  of  a  complicated  function. 

EXAMPLE  3.5.4    Compute  the  derivative  of 

x^  -1 


V'a;2  +  1 " 


X 

The  "last"  operation  here  is  division,  so  to  get  started  we  need  to  use  the  quotient  rule 
first.  This  gives 

.{x^-  i)'xV^2qri  _  (a;2  _  i)(a;^/^2qri)/ 

^       ~  x'^ix^  +  l) 

_  2x^V^^^  -  {x^  -  l){xV^^^^y 
~  a;2(a;2  +  l) 

Now  we  need  to  compute  the  derivative  of  xy^ x'^  +  1.  This  is  a  product,  so  we  use  the 
product  rule: 

-^x\/  x"^  +  1  =  x-^\/x'^  +  1  +  \/x^  +  1. 
ax  ax 

Finally,  we  use  the  chain  rule: 

dx"  dx^         '         2^         )      \    )  V^sqn 
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And  putting  it  all  together: 


f  (^)  =  


x'^{x'^  +  1) 


This  can  be  simplified  of  course,  but  we  have  done  all  the  calculus,  so  that  only  algebra  is 
left.  □ 


EXAMPLE  3.5.5  Compute  the  derivative  of  y  1  +  y  1  +  ^/x.  Here  we  have  a  more 
complicated  chain  of  compositions,  so  we  use  the  chain  rule  twice.  At  the  outermost  "layer" 

we  have  the  function  g(x)  =  1  +  y^'l  +  \/x  plugged  into  f{x)  —  \/x,  so  applying  the  chain 
rule  once  gives 


d  I      r   1         r  -\-"'  d 


Now  we  need  the  derivative  of  y  1  +  \/x.  Using  the  chain  rule  again: 


So  the  original  derivative  is 


ifT^.  =  i  (i  +  \  (1  +  vs)-"^ 


1 


□ 


Using  the  chain  rule,  the  power  rule,  and  the  product  rule,  it  is  possible  to  avoid  using 
the  quotient  rule  entirely. 
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EXAMPLE  3.5.6    Compute  the  derivative  of /(x)  =      ^  ^  .  Write /(x)  =  + 
then 

f'(x)  =  x^-^fx^  +        +  3x2(x2  + 
dx 

=  x^(-l)(x2  +  l)"^(2x)  +  3x^{x^  +  1)"^ 

^_2x4(x2  +  l)-2+3x2(x2  +  l)-l 

-2x4  3x2 

+ 


(x2  +  l)2       x2  +  l 

-2x'^        3x2(x2  +  ll 


(x2  +  l)2  (x2  +  l)2 

_  -2x4  +  3x4  ^       ^  ^ 

-  (x2  +  l)2  ~  (x2  +  l)2 

Note  that  we  aheady  had  the  derivative  on  the  second  hne;  aU  the  rest  is  simphfication.  It 
is  easier  to  get  to  this  answer  by  using  the  quotient  rule,  so  there's  a  trade  off:  more  work 
for  fewer  memorized  formulas.  □ 

Exercises  3.5. 

Find  the  derivatives  of  the  functions.  For  extra  practice,  and  to  check  your  answers,  do  some  of 
these  in  more  than  one  way  if  possible. 

1.  x'^ -2,x^ +  {l/2)x^ +  7X-TT  ^  2.  x^-2x^  +  A^/i^ 

3.  {x^  +  lf^  4.  x\/im  -  z2  ^ 

5.  {x^  —  Ax  +  5)-\/25  —  x'^  =^  6.  \/ r"^  —  x^,  r  is  a  constant 

7.  V^  +  x^  8.  —2= 

9.    (1  +  3x)^  10. 


{x'^  +  X  +  1) 

_     V25  -  x^  _  /l69 

11.   12.    \  x  = 


X  \  x 

13.    ^a;3  -  x2  -  (1/x)  ^  14.    100/(100  -  x^)^/^ 


15.          +  a;3  ^  16.  y  (a;2  +  1)2  +  ^l  +  (^2  +  1)2 

17.  (x  +  S)^^  18.  (4-x)^^ 

19.  (a;^+5)^^  20.  (6  -  2x^f  ^ 

21.  {1-Ax^y^^  22.  5(a;  + 1  -  l/x)  ^ 

23.  4{2x'^  -  X  +  ^  24. 


1  +  1/x 


25.    -^^  1  7^  26.    (x^  +  1)(5  -  2a;)/2 

4a;2-2x  +  l  v     ^  A  ;/ 
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27.    (33;2  +  l)(2a;-4)3  ^  28.  ^ 

X  —  1 

^r.  -\  (a;-l)(x-2) 
29.    ^—  ^  30.    ^  '\  

a;^  +  1  X  —  i 


2x-^  - 
3a;-i  -  4a;-2 


31-   J—^^  32.   3(x^  +  l)(2a;^  -  l)(2x  +  3) 


35.  (2a;  + l)^(a;^  +  1)^  ^ 

36.  Find  an  equation  for  the  tangent  line  to  f{x)  =  {x  —  2)^^^/(x^  +  4x  —  1)^  at  x  =  1. 

37.  Find  an  equation  for  the  tangent  line  to  y  =  9x^^  at  (3, 1). 

38.  Find  an  equation  for  the  tangent  line  to  {x^  —  4x  +  5)\/25  —  x-^  at  (3,  8). 

(x^  +  X  +  1) 

39.  Find  an  equation  for  the  tangent  line  to  —  r —  at  (2,  —7).  =^ 


40.  Find  an  equation  for  the  tangent  line  to  y  (a;^  +  1)^  +  \/l  +  {x'^  +  l)-^  at  (1,  y  4  +  \/5) 


4 

Transcendental  Functions 


So  far  we  have  used  only  algebraic  functions  as  examples  when  finding  derivatives,  that  is, 
functions  that  can  be  built  up  by  the  usual  algebraic  operations  of  addition,  subtraction, 
multiplication,  division,  and  raising  to  constant  powers.  Both  in  theory  and  practice  there 
are  other  functions,  called  transcendental,  that  are  very  useful.  Most  important  among 
these  are  the  trigonometric  functions,  the  inverse  trigonometric  functions,  exponential 
functions,  and  logarithms. 

4.1    Trigonometric  Functions 

When  you  first  encountered  the  trigonometric  functions  it  was  probably  in  the  context  of 
"triangle  trigonometry,"  defining,  for  example,  the  sine  of  an  angle  as  the  "side  opposite 
over  the  hypotenuse."  While  this  will  still  be  useful  in  an  informal  way,  we  need  to  use  a 
more  expansive  definition  of  the  trigonometric  functions.  First  an  important  note:  while 
degree  measure  of  angles  is  sometimes  convenient  because  it  is  so  familiar,  it  turns  out  to 
be  ill-suited  to  mathematical  calculation,  so  (almost)  everything  we  do  will  be  in  terms  of 
radian  measure  of  angles. 


73 
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To  define  the  radian  measurement  system,  we  consider  the  unit  circle  in  the  xy-plaxie: 


:os  a;,  sin  a;) 

(1,0) 

An  angle,  a;,  at  the  center  of  the  circle  is  associated  with  an  arc  of  the  circle  which  is  said 
to  subtend  the  angle.  In  the  figure,  this  arc  is  the  portion  of  the  circle  from  point  (1,0) 
to  point  A.  The  length  of  this  arc  is  the  radian  measure  of  the  angle  x\  the  fact  that  the 
radian  measure  is  an  actual  geometric  length  is  largely  responsible  for  the  usefulness  of 
radian  measure.  The  circumference  of  the  unit  circle  is  27rr  =  27r(l)  =  27r,  so  the  radian 
measure  of  the  full  circular  angle  (that  is,  of  the  360  degree  angle)  is  27r. 

While  an  angle  with  a  particular  measure  can  appear  anywhere  around  the  circle,  we 
need  a  fixed,  conventional  location  so  that  we  can  use  the  coordinate  system  to  define 
properties  of  the  angle.  The  standard  convention  is  to  place  the  starting  radius  for  the 
angle  on  the  positive  x-axis,  and  to  measure  positive  angles  counterclockwise  around  the 
circle.  In  the  figure,  x  is  the  standard  location  of  the  angle  7r/6,  that  is,  the  length  of  the 
arc  from  (1,  0)  to  A  is  7r/6.  The  angle  y  in  the  picture  is  — 7r/6,  because  the  distance  from 
(1,0)  to  B  along  the  circle  is  also  7r/6,  but  in  a  clockwise  direction. 

Now  the  fundamental  trigonometric  definitions  are:  the  cosine  of  x  and  the  sine  of  x 
are  the  first  and  second  coordinates  of  the  point  A^  as  indicated  in  the  figure.  The  angle  x 
shown  can  be  viewed  as  an  angle  of  a  right  triangle,  meaning  the  usual  triangle  definitions 
of  the  sine  and  cosine  also  make  sense.  Since  the  hypotenuse  of  the  triangle  is  1,  the  "side 
opposite  over  hypotenuse"  definition  of  the  sine  is  the  second  coordinate  of  point  A  over 
1,  which  is  just  the  second  coordinate;  in  other  words,  both  methods  give  the  same  value 
for  the  sine. 

The  simple  triangle  definitions  work  only  for  angles  that  can  "fit"  in  a  right  triangle, 
namely,  angles  between  0  and  7r/2.  The  coordinate  definitions,  on  the  other  hand,  apply 
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to  any  angles,  as  indicated  in  this  figure: 


The  angle  x  is  subtended  by  the  heavy  arc  in  the  figure,  that  is,  x  —  Ttt/G.  Both 
coordinates  of  point  A  in  this  figure  are  negative,  so  the  sine  and  cosine  of  77r/6  are  both 
negative. 

The  remaining  trigonometric  functions  can  be  most  easily  defined  in  terms  of  the  sine 
and  cosine,  as  usual: 

sin  a; 
tana;  =   

cos  X 

cos  X 

cot  X  =  

smx 

1 

sec  a;  =   

cos  a; 

1 

CSC  a;  =   

sm,x 

and  they  can  also  be  defined  as  the  corresponding  ratios  of  coordinates. 

Although  the  trigonometric  functions  are  defined  in  terms  of  the  unit  circle,  the  unit 
circle  diagram  is  not  what  we  normally  consider  the  graph  of  a  trigonometric  function. 
(The  unit  circle  is  the  graph  of,  well,  the  circle.)  We  can  easily  get  a  qualitatively  correct 
idea  of  the  graphs  of  the  trigonometric  functions  from  the  unit  circle  diagram.  Consider 
the  sine  function,  y  —  sin  a;.  As  x  increases  from  0  in  the  unit  circle  diagram,  the  second 
coordinate  of  the  point  A  goes  from  0  to  a  maximum  of  1,  then  back  to  0,  then  to  a 
minimum  of  —1,  then  back  to  0,  and  then  it  obviously  repeats  itself.  So  the  graph  of 
y  =  sin  a;  must  look  something  like  this: 
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Similarly,  as  angle  x  increases  from  0  in  the  unit  circle  diagram,  the  first  coordinate  of 
the  point  A  goes  from  1  to  0  then  to  —1,  back  to  0  and  back  to  1,  so  the  graph  of  y  =  cosx 
must  look  something  like  this: 


2tt 

—  TT 

-1  - 

7r/2\ 

TT 

27r 

Exercises  ^.i. 

Some  useful  trigonometric  identities  are  in  appendix  B. 

1.  Find  all  values  of  6  such  that  sin(0)  =  —1;  give  your  answer  in  radians. 

2.  Find  all  values  of  6  such  that  cos(20)  =  1/2;  give  your  answer  in  radians.  =^> 

3.  Use  an  angle  sum  identity  to  compute  cos(7r/12).  =^ 

4.  Use  an  angle  sum  identity  to  compute  tan(57r/12). 

5.  Verify  the  identity  cos^(t)/(l  —  sin(t))  =  1  +  sin(t). 

6.  Verify  the  identity  2csc(20)  =  sec{9)  csc{6). 

7.  Verify  the  identity  sin(36')  -  sin(6l)  =  2  cos(26')  sin(6'). 

8.  Sketch  y  =  2sin(a;). 

9.  Sketch  y  =  sin(3a;). 

10.  Sketch  y  =  sin(—a;). 

11.  Find  all  of  the  solutions  of  2  sin(t)  —  1  —  sin^(i)  =  0  in  the  interval  [0,  27r].  =^> 


4.2    The  Derivative  of  since 

What  about  the  derivative  of  the  sine  function?  The  rules  for  derivatives  that  we  have  are 
no  help,  since  sinx  is  not  an  algebraic  function.  We  need  to  return  to  the  definition  of  the 
derivative,  set  up  a  limit,  and  try  to  compute  it.  Here's  the  definition: 

d  sin(x  +  Ax)  —  sinx 

—  smx=  hm  . 

ax  Ax^o  Ax 

Using  some  trigonometric  identities,  we  can  make  a  little  progress  on  the  quotient: 

sin(x  +  Ax)  —  sin  x      sin  x  cos  Ax  +  sin  Ax  cos  x  —  sin  x 

Ax  Ax 

cos  Ax  —  1  sin  Ax 

=  sm  X  T  h  cos  X  — T  . 

Ax  Ax 
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This  isolates  the  difficult  bits  in  the  two  limits 

cos  Ax  —  1                     sin  Ax 
lim   and      lim  —  . 

Ax^O         Ax  Ax^O  Ax 

Here  we  get  a  little  lucky:  it  turns  out  that  once  we  know  the  second  limit  the  first  is  quite 
easy.  The  second  is  quite  tricky,  however.  Indeed,  it  is  the  hardest  limit  we  will  actually 
compute,  and  we  devote  a  section  to  it. 

4.3     A  HARD  LIMIT 

We  want  to  compute  this  limit: 

sin  Ax 
lim  —  . 

Ax 

Equivalently,  to  make  the  notation  a  bit  simpler,  we  can  compute 

since 
lim  . 

x^O  X 

In  the  original  context  we  need  to  keep  x  and  Ax  separate,  but  here  it  doesn't  hurt  to 
rename  Ax  to  something  more  convenient. 

To  do  this  we  need  to  be  quite  clever,  and  to  employ  some  indirect  reasoning.  The 
indirect  reasoning  is  embodied  in  a  theorem,  frequently  called  the  squeeze  theorem. 

THEOREM  4.3.1  Squeeze  Theorem  Suppose  that  g{x)  <  /(x)  <  h{x)  for  all  x 
close  to  a  but  not  equal  to  a.  If  limaj^.^  (/(x)  =  L  =  lima;^.^  h{x),  then  lim^^a  f{x)  =  L. 

■ 

This  theorem  can  be  proved  using  the  official  definition  of  limit.  We  won't  prove  it 
here,  but  point  out  that  it  is  easy  to  understand  and  believe  graphically.  The  condition 
says  that  /(x)  is  trapped  between  g{x)  below  and  h{x)  above,  and  that  at  x  =  a,  both  g 
and  h  approach  the  same  value.  This  means  the  situation  looks  something  like  figure  4.3.1. 
The  wiggly  curve  is  x^sin(7r/x),  the  upper  and  lower  curves  are  x^  and  —x'^.  Since  the 
sine  function  is  always  between  —1  and  1,  — x^  <  x^sin(7r/x)  <  x^,  and  it  is  easy  to  see 
that  lim2;^.o  =  0  =  \im.j:^ox^.  It  is  not  so  easy  to  see  directly,  that  is  algebraically, 
that  lim2:^.o  sin(7r/x)  =  0,  because  the  tt/x  prevents  us  from  simply  plugging  in  x  =  0. 
The  squeeze  theorem  makes  this  "hard  limit"  as  easy  as  the  trivial  limits  involving  x^. 

To  do  the  hard  limit  that  we  want,  limj;^o(sinx)/x,  we  will  find  two  simpler  functions 
g  and  h  so  that  g{x)  <  (sinx)/x  <  /i(x),  and  so  that  limj:^o  g{x)  =  lim2;^.o  ^(3^)-  Not  too 
surprisingly,  this  will  require  some  trigonometry  and  geometry.  Referring  to  figure  4.3.2, 
X  is  the  measure  of  the  angle  in  radians.  Since  the  circle  has  radius  1,  the  coordinates  of 
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Figure  4.3.1     The  squeeze  theorem. 


point  A  are  (cosx,  sinx),  and  the  area  of  the  smaU  triangle  is  (cos  x  sin  ,t)/2.  This  triangle 
is  completely  contained  within  the  circular  wedge-shaped  region  bordered  by  two  lines  and 
the  circle  from  (1,0)  to  point  A.  Comparing  the  areas  of  the  triangle  and  the  wedge  we 
see  (cosa:sinx)/2  <  x/2,  since  the  area  of  a  circular  region  with  angle  9  and  radius  r  is 
9r^/2.  With  a  little  algebra  this  turns  into  (sina;)/a;  <  1/  cosx,  giving  us  the  h  we  seek. 


Figure  4.3.2     Visualizing  sinx/x. 

To  find  g,  we  note  that  the  circular  wedge  is  completely  contained  inside  the  larger 
triangle.  The  height  of  the  triangle,  from  (1, 0)  to  point  B,  is  tanx,  so  comparing  areas  we 
get  x/2  <  (tana;)/2  =  sin  a;/ (2  cos  a;).  With  a  little  algebra  this  becomes  cosx  <  (sina;)/a;. 
So  now  we  have 

sinx  1 

cos  X  <   <   . 

X  cosx 
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Finally,  the  two  limits  lim2;^.o  cos  x  and  lim2:^.o  1/ cos  a;  are  easy,  because  cos(O)  =  1.  By 
the  squeeze  theorem,  lim2;^.o(sina;)/a;  =  1  as  well. 

Before  we  can  complete  the  calculation  of  the  derivative  of  the  sine,  we  need  one  other 
limit: 

, .     cos  X  —  1 
lim   . 

x^O  X 

This  limit  is  just  as  hard  as  sinx/x,  but  closely  related  to  it,  so  that  we  don't  have  to  a 
similar  calculation;  instead  we  can  do  a  bit  of  tricky  algebra. 


cos  X  —  1      cos  X  —  1  cos  X  +  1       cos  X  —  1 


—  sm  X 


X 


X 


cosx  +  1      x(cosx  +  l)      x(cosx  +  ll 


sm  X     sm  X 
X    cos  X  +  1 


To  compute  the  desired  limit  it  is  sufficient  to  compute  the  limits  of  the  two  final  fractions, 
as  X  goes  to  0.  The  first  of  these  is  the  hard  limit  we've  just  done,  namely  1.  The  second 
turns  out  to  be  simple,  because  the  denominator  presents  no  problem: 


smx 


sinO 


0 


lim  — - — - —  =  — - — - —  =  -  =  0. 
x^o  COS  X  +  1      COS  0  +  1  2 


Thus, 


cosx  —  1 
lim   

x^O  X 


Exercises  4-3- 


1.    Compute  lim 


3.   Compute  lim 


cot(4a;) 


x^o  csc(3x) 

sin  X  —  cos  X 


„     „  ,.  sin(7x 

2.    Compute  lim  — 


4.    Compute  lim 


x^o  sin(2x) 
tan  X 


x^Q  X 


5.  Compute  lim 

x^-K/i  cos(2a;j 

6.  For  all  X  >  0,  4a;  -  9  <  f{x)  <x^-4x  +  7.  Find  lim  f{x). 

7.  For  all  x,  2x  <  g{x)  <  a:^  —      +  2.  Find  lim  g{x). 

8.  Use  the  Squeeze  Theorem  to  show  that  lim      cos(2/x)  =  0. 

x—^-O 
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4.4    The  Derivative  of  sinx,  continued 

Now  we  can  complete  the  calculation  of  the  derivative  of  the  sine: 


d 

dx 


sinx  =  lim 

Ax^O 


sin  (a;  +  Ax)  —  sinx 


=  lim  sin  x 

Ax^O 


Ax 

cos  Ax  —  1 


sin  Aa; 


+  cos  a; 


Ax 


=  sin  a;  •  0  +  cos  x  ■  1  =  cos  x. 


The  derivative  of  a  function  measures  the  slope  or  steepness  of  the  function;  if  we 
examine  the  graphs  of  the  sine  and  cosine  side  by  side,  it  should  be  that  the  latter  appears 
to  accurately  describe  the  slope  of  the  former,  and  indeed  this  is  true: 


Notice  that  where  the  cosine  is  zero  the  sine  does  appear  to  have  a  horizontal  tangent 
line,  and  that  the  sine  appears  to  be  steepest  where  the  cosine  takes  on  its  extreme  values 
of  1  and  —1. 

Of  course,  now  that  we  know  the  derivative  of  the  sine,  we  can  compute  derivatives  of 
more  complicated  functions  involving  the  sine. 

EXAMPLE  4.4.1     Compute  the  derivative  of  sin(a;^). 


-1  - 


-1  - 


sin  X 


cos  X 


d 

dx 


sin(a;^)  =  cos(a;^)  •  2x  —  2a;cos(a;^). 


□ 


example  4.4.2    Compute  the  derivative  of  sin^(a;^  -  5a;). 


d 

dx 


sin^(a;^  —  5a;) 


^(sin(a;3-5a;))2 

dx 

2(sin(a;^  —  5x))^  cos(x"^  —  5x){3x'^  —  5) 
2(3a;^  —  5)  cos(a;^  —  5a;)  sin(a;^  —  5a;). 


□ 
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Exercises  4-4 • 

Find  the  derivatives  of  the  following  functions. 

1.    sin'^(v^)  =^  2.  \/xsmx 

o        1  .     x'^  +  X 

3.   4.    —   = 


sm  X  sm  X 

sin^  X  =^ 


5.  Vl 


4.5    Derivatives  of  the  Trigonometric  Functions 

All  of  the  other  trigonometric  functions  can  be  expressed  in  terms  of  the  sine,  and  so  their 
derivatives  can  easily  be  calculated  using  the  rules  we  already  have.  For  the  cosine  we 
need  to  use  two  identities, 

COS X  =  sm[x  H — J, 
2 

smx  =  —  cos[x  +  —). 

Now: 

d  d  Tc  n 

—  cosx  =  —  sm(x  H — I  =  cosix  H — I  ■  1  =  —  smx 
dx  dx      ^       2'  ^2' 

d  d  sin  X      cos^  x  +  sin^  x         1  9 

—  tan  X  —  =   7,  =  — 7. —  =  sec  x 

dx  dx  cos  X  cos^  x  cos^  x 

secx  =  4~(cosx)~^  =  — l(cosx)~^(—  sinx)  =      ^  =  secxtanx 
dx  dx  cos^  X 

The  derivatives  of  the  cotangent  and  cosecant  are  similar  and  left  as  exercises. 
Exercises  4-5- 

Find  the  derivatives  of  the  following  functions. 

1.   sina;cosx=^  2.   sin(cosa;)  =^ 

3.    V X  tan X  =^>  4.    tan         +  sinx) 

5.   cot  a;  6.   esc  a;  =^> 

7.   x^sin(23a;^)  =^>  8.   sin^  a;  +  cos^  x 

9.    sin(cos(6a;))  =^ 

,  „    „  d     sec  6 

10.  Compute 


dO  1  +  sec  6 


11.  Compute  -^t^cos(6i).  =^ 

^  d  t^sinrat) 

12.  Compute  t^t^-  ^ 

^       dt  cos(2t) 

13.  Find  all  points  on  the  graph  of  f{x)  =  sin^(a;)  at  which  the  tangent  line  is  horizontal. 
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14.  Find  all  points  on  the  graph  of  f{x)  =  2  sin(x)  — sin'^(a;)  at  which  the  tangent  line  is  horizontal. 

=^ 

15.  Find  an  equation  for  the  tangent  line  to  sin^(x)  at  a;  =  7r/3.  =^> 

16.  Find  an  equation  for  the  tangent  line  to  sec^  x  at  x  =  tt/S.  =^ 

17.  Find  an  equation  for  the  tangent  line  to  cos'^  x  —  sin'^(4x)  at  x  =  7r/6.  =^ 

18.  Find  the  points  on  the  curve  y  =  x  +  2cosx  that  have  a  horizontal  tangent  line. 

19.  Let  C  be  a  circle  of  radius  r.  Let  A  be  an  arc  on  C  subtending  a  central  angle  6.  Let  B  be 
the  chord  of  C  whose  endpoints  are  the  endpoints  of  A.  (Hence,  B  also  subtends  6.)  Let 
s  be  the  length  of  A  and  let  d  be  the  length  of  B.  Sketch  a  diagram  of  the  situation  and 
compute  lim  s/d. 

4.6    Exponential  and  Logarithmic  functions 

An  exponential  function  has  the  form  a^,  where  a  is  a  constant;  examples  are  2^,  10^,  e^. 
The  logarithmic  functions  are  the  inverses  of  the  exponential  functions,  that  is,  functions 
that  "undo"  the  exponential  functions,  just  as,  for  example,  the  cube  root  function  "un- 
does" the  cube  function:  =  2.  Note  that  the  original  function  also  undoes  the  inverse 
function:  {^)^  =  8. 

Let  f{x)  =  2^.  The  inverse  of  this  function  is  called  the  logarithm  base  2,  denoted 
log2(x)  or  (especially  in  computer  science  circles)  lg(a;).  What  does  this  really  mean?  The 
logarithm  must  undo  the  action  of  the  exponential  function,  so  for  example  it  must  be  that 
lg(2'^)  =  3 — starting  with  3,  the  exponential  function  produces  2^  =  8,  and  the  logarithm 
of  8  must  get  us  back  to  3.  A  little  thought  shows  that  it  is  not  a  coincidence  that  lg(2^) 
simply  gives  the  exponent — the  exponent  is  the  original  value  that  we  must  get  back  to. 
In  other  words,  the  logarithm  is  the  exponent.  Remember  this  catchphrase,  and  what  it 
means,  and  you  won't  go  wrong.  (You  do  have  to  remember  what  it  means.  Like  any 
good  mnemonic,  "the  logarithm  is  the  exponent"  leaves  out  a  lot  of  detail,  like  "Which 
exponent?"  and  "Exponent  of  what?") 

EXAMPLE  4.6.1  What  is  the  value  of  logio(lOOO)?  The  "10"  tells  us  the  appropriate 
number  to  use  for  the  base  of  the  exponential  function.  The  logarithm  is  the  exponent, 
so  the  question  is,  what  exponent  E  makes  10^  =  1000?  If  we  can  find  such  an  E,  then 
logj^o(lOOO)  =  logio(lO^)  —  ^'t  finding  the  appropriate  exponent  is  the  same  as  finding  the 
logarithm.  In  this  case,  of  course,  it  is  easy:  i?  =  3  so  log j^g  (1000)  =  3.  □ 

Let's  review  some  laws  of  exponents  and  logarithms;  let  a  be  a  positive  number.  Since 
=  a  -  a-  a  -  a  -  a  and      =  a  -  a-  a,  it's  clear  that  a^-a"^  =  a-a-a-a-a-a-a-a  =  a^  =  a^~^^, 
and  in  general  that  a^a^  =  a"^^".  Since  "the  logarithm  is  the  exponent,"  it's  no  surprise 
that  this  translates  directly  into  a  fact  about  the  logarithm  function.  Here  are  three  facts 
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from  the  example:  log„(a^)  =  5,  log^{a^)  =  3,  log„(a^)  =  8.  So  \og^{a^a^)  =  log^{a^)  = 
8  =  5  +  3  =  log^(a^)  +  log^(a^).  Now  let's  make  this  a  bit  more  general.  Suppose  A  and 
B  are  two  numbers,  A  =  a^,  and  B  =  a^.  Then  \ogg^{AB)  =  log„(a^a^)  =  log^(a^"''^)  = 
x  +  y  =  \og^iA)  +  \og,{B). 

Now  consider  (a^)^  =  ■  a''  ■  a''  =  a^+^+^  =  a^'^  =  a^^ .  Again  it's  clear  that  more 
generally  (a"^)"^  =  a"*"^,  and  again  this  gives  us  a  fact  about  logarithms.  If  A  =  a"^  then 
Ay  =  (a^)y  =  a^y,  so  logg^{Ay)  =  xy  =  ylog^(yl) — the  exponent  can  be  "pulled  out  in 
front." 

We  have  cheated  a  bit  in  the  previous  two  paragraphs.  It  is  obvious  that  =  a-a-a-a-a 
and  =  a  -  a  -  a  and  that  the  rest  of  the  example  follows;  likewise  for  the  second  example. 
But  when  we  consider  an  exponential  function  we  can't  be  limited  to  substituting 
integers  for  x.  What  does  a^'^  or  a~^-^  or  a'^  mean?  And  is  it  really  true  that  a'^-^a~^-^  = 
^2.5-1.37  rpj^g  answer  to  the  first  question  is  actually  quite  difficult,  so  we  will  evade  it; 
the  answer  to  the  second  question  is  "yes." 

We'll  evade  the  full  answer  to  the  hard  question,  but  we  have  to  know  something  about 
exponential  functions.  You  need  first  to  understand  that  since  it's  not  "obvious"  what  2^ 
should  mean,  we  are  really  free  to  make  it  mean  whatever  we  want,  so  long  as  we  keep  the 
behavior  that  is  obvious,  namely,  when  a;  is  a  positive  integer.  What  else  do  we  want  to 
be  true  about  2^?  We  want  the  properties  of  the  previous  two  paragraphs  to  be  true  for 
all  exponents:  2^2^  =  2^+^  and  (2^)^  =  2^^. 

After  the  positive  integers,  the  next  easiest  number  to  understand  is  0:  2*^  =  1.  You 
have  presumably  learned  this  fact  in  the  past;  why  is  it  true?  It  is  true  precisely  because 
we  want  2"2''  —  2"+^  to  be  true  about  the  function  2^.  We  need  it  to  be  true  that 
2^2^  =  2°+^  =  2^,  and  this  only  works  if  2°  =  1.  The  same  argument  implies  that  a°  =  1 
for  any  a. 

The  next  easiest  set  of  numbers  to  understand  is  the  negative  integers:  for  example, 
2~^  =  1/2^.  We  know  that  whatever  2~^  means  it  must  be  that  2~^2^  =  2~^+^  =  2°  =  1, 
which  means  that  must  be  1/2^.  In  fact,  by  the  same  argument,  once  we  know  what 
2^  means  for  some  value  of  x,  2~^  must  be  1/2^  and  more  generally  = 

Next,  consider  an  exponent  1/q,  where  q  is  a  positive  integer.  We  want  it  to  be  true 
that  (2^)f  =  2^^,  so  (2i/«)«  =  2.  This  means  that  2^/«  is  a  q-th  root  of  2,  2^/^  =  This 
is  all  we  need  to  understand  that  2p/i  =  (2^/i)p  =  (^)p  and  a^/^  =  (aV«)f  =  (1^)^. 

What's  left  is  the  hard  part:  what  does  2^  mean  when  x  cannot  be  written  as  a 
fraction,  like  x  =  -\/2~  or  a;  =  tt?  What  we  know  so  far  is  how  to  assign  meaning  to  2^ 


84     Chapter  4  Transcendental  Functions 

whenever  x  =  p/q;  if  we  were  to  graph  this  we'd  see  something  hke  this: 


But  this  is  a  poor  picture,  because  you  can't  see  that  the  "curve"  is  really  a  whole  lot 
of  individual  points,  above  the  rational  numbers  on  the  x-axis.  There  are  really  a  lot  of 
"holes"  in  the  curve,  above  x  =  tx,  for  example.  But  (this  is  the  hard  part)  it  is  possible 
to  prove  that  the  holes  can  be  "filled  in",  and  that  the  resulting  function,  called  2^,  really 
does  have  the  properties  we  want,  namely  that  2^2^  =  2^+^  and  (2^)^  =  2^^. 

Exercises  4-6. 

1.  Expandlogio((2.-  +  45)^(x-2)). 

2.  Expand  log. 


^  3x-5  +  (7/x)  ■ 

3.  Write  loga  3a;  +  171og2(x  —  2)  —  2  Iog2(a;^  +  4x  +  1)  as  a  single  logarithm. 


4.  Solve  log2(l  +  Vx)  =  6  for  x. 

5.  Solve  2"^^  =  8  for  x. 

6.  Solve  log2(log3(a;))  =  1  for  x. 


4.7    Derivatives  of  the  exponential  and 
logarithmic  functions 

As  with  the  sine,  we  don't  know  anything  about  derivatives  that  allows  us  to  compute 
the  derivatives  of  the  exponential  and  logarithmic  functions  without  going  back  to  basics. 
Let's  do  a  little  work  with  the  definition  again: 

d    ^      ,       a^+'^^  — 
-a   —  hm 


dx        Ax^o  Ax 

(X  (X  ~ 

=  hm 


Ax-^O  Ax 

hm  a  - 


Ax^O  Ax 

X  a^'^-l 

a    hm  —  

Ax^O  Ax 
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There  are  two  interesting  things  to  note  here:  As  in  the  case  of  the  sine  function  we  are  left 
with  a  hmit  that  involves  Ax  but  not  x,  which  means  that  whatever   lim  (a^^  —  1)/Aa; 

is,  we  know  that  it  is  a  number,  that  is,  a  constant.  This  means  that  has  a  remarkable 
property:  its  derivative  is  a  constant  times  itself. 

We  earlier  remarked  that  the  hardest  limit  we  would  compute  is  lim  sin  a;/a;  =  1;  we 

now  have  a  limit  that  is  just  a  bit  too  hard  to  include  here.  In  fact  the  hard  part  is  to 
see  that  lim  (a^^  —  1)/Aa;  even  exists — does  this  fraction  really  get  closer  and  closer  to 

Aa;->-0 

some  fixed  value?  Yes  it  does,  but  we  will  not  prove  this  fact. 

We  can  look  at  some  examples.  Consider  (2^  —  l)/x  for  some  small  values  of  a;:  1, 
0.828427124,  0.756828460,  0.724061864,  0.70838051,  0.70070877  when  x  is  1,  1/2,  1/4, 
1/8,  1/16,  1/32,  respectively.  It  looks  like  this  is  settling  in  around  0.7,  which  turns  out 
to  be  true  (but  the  limit  is  not  exactly  0.7).  Consider  next  (3^  -  \)/x:  2,  1.464101616, 
1.264296052,  1.177621520,  1.13720773,  1.11768854,  at  the  same  values  of  x.  It  turns  out 
to  be  true  that  in  the  limit  this  is  about  1.1.  Two  examples  don't  establish  a  pattern,  but 
if  you  do  more  examples  you  will  find  that  the  limit  varies  directly  with  the  value  of  a: 
bigger  a,  bigger  limit;  smaller  a,  smaller  limit.  As  we  can  already  see,  some  of  these  limits 
will  be  less  than  1  and  some  larger  than  1.  Somewhere  between  a  =  2  and  a  =  3  the  limit 
will  be  exactly  1;  the  value  at  which  this  happens  is  called  e,  so  that 

lim  — T  =  1. 

Ax^Q  lAX 

As  you  might  guess  from  our  two  examples,  e  is  closer  to  3  than  to  2,  and  in  fact  e  f=:i  2.718. 
Now  we  see  that  the  function      has  a  truly  remarkable  property: 

d    ^         ,  gX+Ax_gX 

-e  =  lim 


dx        Ax^o  Ax 
=  lim 


Ax^o  Aa; 

gAK  _  I 

fim 
=  lim 


Ax^O  Ax 

e^^  -1 


Ax^O  Ax 

That  is,  is  its  own  derivative,  or  in  other  words  the  slope  of  is  the  same  as  its  height, 
or  the  same  as  its  second  coordinate:  The  function  f{x)  =  goes  through  the  point 
(z,  e^)  and  has  slope  there,  no  matter  what  z  is.  It  is  sometimes  convenient  to  express 
the  function  without  an  exponent,  since  complicated  exponents  can  be  hard  to  read.  In 
such  cases  we  use  exp(a;),  e.g.,  exp(l  +  x"^)  instead  of  e"^"*"^  . 
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What  about  the  logarithm  function?  This  too  is  hard,  but  as  the  cosine  function  was 
easier  to  do  once  the  sine  was  done,  so  the  logarithm  is  easier  to  do  now  that  we  know 
the  derivative  of  the  exponential  function.  Let's  start  with  logg  x,  which  as  you  probably 
know  is  often  abbreviated  In  a;  and  called  the  "natural  logarithm"  function. 

Consider  the  relationship  between  the  two  functions,  namely,  that  they  are  inverses, 
that  one  "undoes"  the  other.  Graphically  this  means  that  they  have  the  same  graph  except 
that  one  is  "flipped"  or  "reflected"  through  the  line  y  =  x,  as  shown  in  figure  4.7.1. 


Figure  4.7.1     The  exponential  and  logarithm  functions. 

This  means  that  the  slopes  of  these  two  functions  are  closely  related  as  well:  For  example, 
the  slope  of  is  e  at  a;  =  1;  at  the  corresponding  point  on  the  ln(a;)  curve,  the  slope  must 
be  1/e,  because  the  "rise"  and  the  "run"  have  been  interchanged.  Since  the  slope  of  is 
e  at  the  point  (1,  e),  the  slope  of  ln(a;)  is  1/e  at  the  point  (e,  1). 


Figure  4.7.2     Slope  of  the  exponential  and  logarithm  functions. 

More  generally,  we  know  that  the  slope  of  is  at  the  point  (z,  e^),  so  the  slope  of 
ln(a;)  is  1/e^  at  (e^,  z),  as  indicated  in  figure  4.7.2.  In  other  words,  the  slope  of  In  a;  is  the 
reciprocal  of  the  first  coordinate  at  any  point;  this  means  that  the  slope  of  Inx  at  (x,  Inx) 
is  1/x.  The  upshot  is: 

d  ,  1 


4.7    Derivatives  of  the  exponential  and  logarithmic  functions  87 


We  have  discussed  this  from  the  point  of  view  of  the  graphs,  which  is  easy  to  understand 
but  is  not  normally  considered  a  rigorous  proof — it  is  too  easy  to  be  led  astray  by  pictures 
that  seem  reasonable  but  that  miss  some  hard  point.  It  is  possible  to  do  this  derivation 
without  resorting  to  pictures,  and  indeed  we  will  see  an  alternate  approach  soon. 

Note  that  In  a;  is  defined  only  for  a;  >  0.  It  is  sometimes  useful  to  consider  the  function 
In  |a;|,  a  function  defined  for  x  ^  0.  When  a;  <  0,  In  |a;|  =  ln(— and 

-^Inlxl  =  -^ln(—x)  =  — (— 1)  =  — • 
ax  ax  —X  X 

Thus  whether  x  is  positive  or  negative,  the  derivative  is  the  same. 

What  about  the  functions  and  log„  x?  We  know  that  the  derivative  of  is  some 
constant  times  itself,  but  what  constant?  Remember  that  "the  logarithm  is  the  expo- 
nent" and  you  will  see  that  a  =  e'"".  Then 

X         (  lna\x    xlna 

a    —  I  o      I    —  o  , 

and  we  can  compute  the  derivative  using  the  chain  rule: 

A^a;  ^  _l(ei"«)^  =  _le^'°«  =  (lna)e^'°'^  =  (Ina)a^. 
ax        ax  ax 

The  constant  is  simply  In  a.  Likewise  we  can  compute  the  derivative  of  the  logarithm 
function  log^  x.  Since 

we  can  take  the  logarithm  base  a  of  both  sides  to  get 

loga(a^)  =  log^le^""^)  =  lna;log„  e. 

Then 

d  1 
—  log„  x  =  -  log„  e. 
ax  X 

This  is  a  perfectly  good  answer,  but  we  can  improve  it  slightly.  Since 

a  =  6^'"' 
loga(a)  =  log„(e^"")  =  Inalog^e 
1  =  In  a  log(j  e 

p-  =  log„  e, 
ma 
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we  can  replace  log^j  e  to  get 

d  1 

loga  X  = 


dx      "        X  In  a 
You  may  if  you  wish  memorize  the  formulas 

-^a^  =  (In  a)a"^    and        loga^  =      — • 

dx  dx  xma 

Because  the  "trick"  a  =  e^""  is  often  useful,  and  sometimes  essential,  it  may  be  better  to 
remember  the  trick,  not  the  formula. 

EXAMPLE  4.7.1     Compute  the  derivative  of  f{x)  =  T. 

dx  dx 
dx 

^x\n2]  e-i'^^ 
dx  J 

a;  In  2    r^x 


=  (ln2)e^'^^  =  2^1n2 


EXAMPLE  4.7.2    Compute  the  derivative  of  f{x)  =  2^'  =  2(^^). 

dx  dx 

dx  J 
=  (21n2)a;e^''°2 

=  (21n2)a;2^' 


□ 


□ 


EXAMPLE  4.7.3  Compute  the  derivative  of  f{x)  =  x^.  At  first  this  appears  to  be 
a  new  kind  of  function:  it  is  not  a  constant  power  of  x,  and  it  does  not  seem  to  be  an 
exponential  function,  since  the  base  is  not  constant.  But  in  fact  it  is  no  harder  than  the 
previous  example. 

d    ^  d 


dx  dx 


dx 


=  (x — h  lna;)a;^ 

X 


(l+lna;)a;^  ^ 
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EXAMPLE  4.7.4  Recall  that  we  have  not  justified  the  power  rule  except  when  the 
exponent  is  a  positive  or  negative  integer.  We  can  use  the  exponential  function  to  take 
care  of  other  exponents. 


dx  dx 


rlnx  )  e"^"^ 

dx 


(r  — 

X 

r-l 

rx 


□ 


Exercises  4. 7. 


In  1- 

-19,  find  the  derivat 

ives  of  the  functions. 

1. 

2. 

smx 
  ^ 

3. 

(e  ) 

4. 

sin(e")  =^ 

5. 

sin  X 

e  =^ 

6. 

sin  X 
X  =^ 

7. 

3  X 

X  e  =^ 

8. 

x  +  2"^  ^ 

9. 

(1/3)"'  ^ 

10. 

4x  1  . 

e  /x 

11. 

ln(x^  +  3x)  =^ 

12. 

ln(cos(2;)) 

13. 

\/ln(x2)/x  =^ 

14. 

ln(sec(x)  +  tan(x)) 

15. 

16. 

xlnx 

17. 

ln(ln(3a;)) 

18. 

1  +  ln(3x2) 
1  +  ln(4a;) 

19. 

27x6  (4a;  -  6)8 

20. 

Find  the  value  of  a 
Sketch  the  resulting 

so  that  the  tangent  line  to  y 
;  situation.  =^> 

=  ln(x)  at  X  =  a  is  a 

21. 

If  f{x)  =  ln{x^  +  2] 

1  compute  /'(e^^^). 

22. 

If  y  =  log„  X  then  a 

^  =  X.  Use  implicit  differentiation  to  find  y' . 

4.8    Implicit  Differentiation 

As  we  have  seen,  there  is  a  close  relationship  between  the  derivatives  of  and  In  x  because 
these  functions  are  inverses.  Rather  than  relying  on  pictures  for  our  understanding,  we 
would  like  to  be  able  to  exploit  this  relationship  computationally.  In  fact  this  technique 
can  help  us  find  derivatives  in  many  situations,  not  just  when  we  seek  the  derivative  of  an 
inverse  function. 
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We  will  begin  by  illustrating  the  technique  to  find  what  we  already  know,  the  derivative 
of  In  a;.  Let's  write  y  =  \nx  and  then  x  =  e'°^  =  e^,  that  is,  x  =  e^.  We  say  that 
this  equation  defines  the  function  y  =  \nx  implicitly  because  while  it  is  not  an  explicit 
expression  y  =  . . .,  it  is  true  that  if  x  —  then  y  is  in  fact  the  natural  logarithm  function. 
Now,  for  the  time  being,  pretend  that  all  we  know  of  y  is  that  x  =  e^;  what  can  we  say 
about  derivatives?  We  can  take  the  derivative  of  both  sides  of  the  equation: 


dx  dx 


Then  using  the  chain  rule  on  the  right  hand  side: 


Then  we  can  solve  for  y': 


dx' 


'=1  =  1 

ev  X 


There  is  one  little  difficulty  here.  To  use  the  chain  rule  to  compute  d/dx{e^)  =  y'e^  we 
need  to  know  that  the  function  y  has  a  derivative.  All  we  have  shown  is  that  if  it  has  a 
derivative  then  that  derivative  must  be  1/x.  When  using  this  method  we  will  always  have 
to  assume  that  the  desired  derivative  exists,  but  fortunately  this  is  a  safe  assumption  for 
most  such  problems. 

The  example  y  =  \nx  involved  an  inverse  function  defined  implicitly,  but  other  func- 
tions can  be  defined  implicitly,  and  sometimes  a  single  equation  can  be  used  to  implicitly 
define  more  than  one  function.  Here's  a  familiar  example.  The  equation  r'^  =  x^  +  y^ 
describes  a  circle  of  radius  r.  The  circle  is  not  a  function  y  =  f{x)  because  for  some  values 
of  X  there  are  two  corresponding  values  of  y.  If  we  want  to  work  with  a  function,  we  can 
break  the  circle  into  two  pieces,  the  upper  and  lower  semicircles,  each  of  which  is  a  function. 
Let's  call  these  y  =  U{x)  and  y  =  L(a;);  in  fact  this  is  a  fairly  simple  example,  and  it's 
possible  to  give  explicit  expressions  for  these:  U{x)  =  —  x'^  and  L{x)  =  —s/ r'^  —  x'^  . 
But  it's  somewhat  easier,  and  quite  useful,  to  view  both  functions  as  given  implicitly  by 
=  +  y^:  both  =  +  U{x)'^  and  r'^  —  x'^  +  L{x)'^  are  true,  and  we  can  think  of 
=      +  y^  as  defining  both  U{x)  and  L{x). 

Now  we  can  take  the  derivative  of  both  sides  as  before,  remembering  that  y  is  not 
simply  a  variable  but  a  function — in  this  case,  y  is  either  U{x)  or  L{x)  but  we're  not  yet 
specifying  which  one.  When  we  take  the  derivative  we  just  have  to  remember  to  apply  the 
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chain  rule  where  y  appears. 

—r  =  —{x  +y 
ax  ax 

0  =  2a;  +  2yy' 

1  —2x  X 

y  =  -TT-  =  — 
2y  y 

Now  we  have  an  expression  for  y',  but  it  contains  y  as  well  as  x.  This  means  that  if  we 
want  to  compute  y'  for  some  particular  value  of  x  we'll  have  to  know  or  compute  y  at  that 
value  of  X  as  well.  It  is  at  this  point  that  we  will  need  to  know  whether  y  is  U{x)  or  L{x). 
Occasionally  it  will  turn  out  that  we  can  avoid  explicit  use  of  U{x)  or  L{x)  by  the  nature 
of  the  problem 

EXAMPLE  4.8.1  Find  the  slope  of  the  circle  4  =  +  at  the  point  (1,  - ^3).  Since 
we  know  both  the  x  and  y  coordinates  of  the  point  of  interest,  we  do  not  need  to  explicitly 
recognize  that  this  point  is  on  L{x),  and  we  do  not  need  to  use  L{x)  to  compute  y — but 
we  could.  Using  the  calculation  of  y'  from  above, 

,  _    X  _       1     _  1 

^  ~~y~~W^~W 

It  is  instructive  to  compare  this  approach  to  others. 

We  might  have  recognized  at  the  start  that  (1,  —VS)  is  on  the  function  y  =  L{x)  = 
—  yj 4  —  x'^.  We  could  then  take  the  derivative  of  L{x\  using  the  power  rule  and  the  chain 
rule,  to  get 

L'(a;)  =  -J(4-a;2)-V2(_2a;)  ^ 


2'         '       ^       '  ^ 


X 


2 


Then  we  could  compute  -^^'(l)  =  I/'n/S  by  substituting  x  =\. 

Alternately,  we  could  realize  that  the  point  is  on  L{x\  but  use  the  fact  that  y'  =  —x/y. 
Since  the  point  is  on  L{x)  we  can  replace  y  by  L{x)  to  get 


y  = 


L{x)  ^ 


X 


2' 


without  computing  the  derivative  of  L{x)  explicitly.  Then  we  substitute  x  =  \  and  get  the 
same  answer  as  before.  □ 

In  the  case  of  the  circle  it  is  possible  to  find  the  functions  XJ [x)  and  L{x)  explicitly,  but 
there  are  potential  advantages  to  using  implicit  diff'erentiation  anyway.  In  some  cases  it  is 
more  difficult  or  impossible  to  find  an  explicit  formula  for  y  and  implicit  differentiation  is 
the  only  way  to  find  the  derivative. 
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EXAMPLE  4.8.2  Find  the  derivative  of  any  function  defined  implicitly  by  ya;^  +  e^  =  x. 
We  treat  y  as  an  unspecified  function  and  use  the  chain  rule: 

iiy-'  +  ey)  =  lx 

{y-2x  +  y'-  x'^)  +  y'e^  =  1 

y'x'^  +  y'e^  =  1  —  2xy 

y' {x"^  +  e^)  =  1  -  2xy 

,  ^  1  -  2xy 

^  ~  x^  +  ey  □ 

You  might  think  that  the  step  in  which  we  solve  for  y'  could  sometimes  be  difficult — 
after  all,  we're  using  implicit  differentiation  here  because  we  can't  solve  the  equation 
yx^  +  =  X  for  y,  so  maybe  after  taking  the  derivative  we  get  something  that  is  hard  to 
solve  for  y'.  In  fact,  this  never  happens.  All  occurrences  y'  come  from  applying  the  chain 
rule,  and  whenever  the  chain  rule  is  used  it  deposits  a  single  y'  multiplied  by  some  other 
expression.  So  it  will  always  be  possible  to  group  the  terms  containing  y'  together  and 
factor  out  the  y',  just  as  in  the  previous  example.  If  you  ever  get  anything  more  difficult 
you  have  made  a  mistake  and  should  fix  it  before  trying  to  continue. 

It  is  sometimes  the  case  that  a  situation  leads  naturally  to  an  equation  that  defines  a 
function  implicitly. 


EXAMPLE  4.8.3  Consider  all  the  points  (x,  y)  that  have  the  property  that  the  distance 
from  {x,y)  to  {xi,yi)  plus  the  distance  from  {x,y)  to  {x2,y2)  is  2a  (a  is  some  constant). 
These  points  form  an  ellipse,  which  like  a  circle  is  not  a  function  but  can  viewed  as  two 
functions  pasted  together.  Because  we  know  how  to  write  down  the  distance  between  two 
points,  we  can  write  down  an  implicit  equation  for  the  ellipse: 

y/{x-xi)^  +  {y-yiY  +  \/{x-  X2Y  +  {y-  y2Y  =  2a. 

Then  we  can  use  implicit  differentiation  to  find  the  slope  of  the  ellipse  at  any  point,  though 
the  computation  is  rather  messy.  □ 

EXAMPLE  4.8.4  We  have  already  justified  the  power  rule  by  using  the  exponential 
function,  but  we  could  also  do  it  for  rational  exponents  by  using  implicit  differentiation. 
Suppose  that  y  =  x'^^'^,  where  m  and  n  are  positive  integers.  We  can  write  this  implicitly 
as      =  x"^,  then  because  we  justified  the  power  rule  for  integers,  we  can  take  the  derivative 
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of  each  side: 


y 


y 


m  x^ 
n  y'^~^ 
m  x'^~^ 


y'  =  !^!^m-l-(m/n)(n-l) 

n 

y'  =  '['l^rn-l-m+im/n) 

n 

y'  =  !^a:("^/^)-i 
n 

□ 

Exercises  4. 5. 

In  exercises  1-8,  find  a  formula  for  the  derivative  y'  at  the  point  (x,  y): 

1.  =  l  +  x^  ^ 

2.  +  xy  +      =  7  =^> 

3 3    I  2  3    I  2  , 

.  X  +  xy  =  y  +  yx  =^> 

4.  4  cos  X  sin  y  =  1 

5.  +  ^  =  9  ^ 

6.  tan(x/y)  =  x  +  y  =^ 

7.  sin(x  +  y)  =  xy  =^ 

8.  i  +  i=7^ 

X  y 

9.  A  hyperbola  passing  through  (8,  6)  consists  of  all  points  whose  distance  from  the  origin  is  a 
constant  more  than  its  distance  from  the  point  (5,2).  Find  the  slope  of  the  tangent  line  to 
the  hyperbola  at  (8,6).  =^> 

10.  Compute  y'  for  the  ellipse  of  example  4.8.3. 

11.  The  graph  of  the  equation  x'^  —  xy  +  y^  =  9  is  an  ellipse.  Find  the  lines  tangent  to  this  curve 
at  the  two  points  where  it  intersects  the  x-axis.  Show  that  these  lines  are  parallel.  =^> 

12.  Repeat  the  previous  problem  for  the  points  at  which  the  ellipse  intersects  the  y-axis. 

13.  Find  the  points  on  the  ellipse  from  the  previous  two  problems  where  the  slope  is  horizontal 
and  where  it  is  vertical.  =^ 

14.  Find  an  equation  for  the  tangent  line  to  x^  =  y'^  +x^  at  (2,  \/l2).  (This  curve  is  the  kampyle 
of  Eudoxus.)  =^ 

15.  Find  an  equation  for  the  tangent  line  to  x^'^^  +  y'^^^  =  a'^^  at  a  point  (xi,  yi)  on  the  curve, 
with  xi  7^  0  and  yi  7^  0.  (This  curve  is  an  astroid.)  =^ 

16.  Find  an  equation  for  the  tangent  line  to  [x?  +y'^)^  =  x^  —y^  at  a  point  (xi, yi)  on  the  curve, 
with  xi  7^  0,  —1, 1.  (This  curve  is  a  lemniscate.) 
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Definition.  Two  curves  are  orthogonal  if  at  each  point  of  intersection,  the  angle  between 
their  tangent  Hnes  is  7r/2.  Two  famiUes  of  curves,  A  and  B,  are  orthogonal  trajectories  of  each 
other  if  given  any  curve  C  in  ^  and  any  curve  D  in  B  the  curves  C  and  D  are  orthogonal.  For 
example,  the  family  of  horizontal  lines  in  the  plane  is  orthogonal  to  the  family  of  vertical  lines  in 
the  plane. 

17.  Show  that  —  =  5  is  orthogonal  to  4:X^  +  9y^  =  72.  (Hint:  You  need  to  find  the 
intersection  points  of  the  two  curves  and  then  show  that  the  product  of  the  derivatives  at 
each  intersection  point  is  —1.) 

18.  Show  that  =  is  orthogonal  to  y  =  mx.  Conclude  that  the  family  of  circles  centered 
at  the  origin  is  an  orthogonal  trajectory  of  the  family  of  lines  that  pass  through  the  origin. 

Note  that  there  is  a  technical  issue  when  m  =  0.  The  circles  fail  to  be  differentiable  when  they 
cross  the  a;-axis.  However,  the  circles  are  orthogonal  to  the  a;-axis.  Explain  why.  Likewise, 
the  vertical  line  through  the  origin  requires  a  separate  argument. 

19.  For  k  ^  0  and  c  7^  0  show  that  —  x^  =  k  is  orthogonal  to  yx  =  c.  In  the  case  where  k  and 
c  are  both  zero,  the  curves  intersect  at  the  origin.  Are  the  curves  y^  —  x^  =  0  and  yx  =  0 
orthogonal  to  each  other? 

20.  Suppose  that  m  7^  0.  Show  that  the  family  of  curves  {y  =  mx  +  &  |  6  G  M}  is  orthogonal  to 
the  family  of  curves  {y  =  —{x/m)  +  c  |  c  G  M}. 

4.9    Inverse  Trigonometric  Functions 

The  trigonometric  functions  frequently  arise  in  problems,  and  often  it  is  necessary  to 
invert  the  functions,  for  example,  to  find  an  angle  with  a  specified  sine.  Of  course,  there 
are  many  angles  with  the  same  sine,  so  the  sine  function  doesn't  actually  have  an  inverse 
that  reliably  "undoes"  the  sine  function.  If  you  know  that  sinx  =  0.5,  you  can't  reverse 
this  to  discover  x,  that  is,  you  can't  solve  for  x,  as  there  are  infinitely  many  angles  with 
sine  0.5.  Nevertheless,  it  is  useful  to  have  something  like  an  inverse  to  the  sine,  however 
imperfect.  The  usual  approach  is  to  pick  out  some  collection  of  angles  that  produce  all 
possible  values  of  the  sine  exactly  once.  If  we  "discard"  all  other  angles,  the  resulting 
function  does  have  a  proper  inverse. 

The  sine  takes  on  all  values  between  —1  and  1  exactly  once  on  the  interval  [— 7r/2, 7r/2]. 
If  we  truncate  the  sine,  keeping  only  the  interval  [— 7r/2, 7r/2],  as  shown  in  figure  4.9.1,  then 
this  truncated  sine  has  an  inverse  function.  We  call  this  the  inverse  sine  or  the  arcsine, 
and  write  y  =  arcsin(x). 

Recall  that  a  function  and  its  inverse  undo  each  other  in  either  order,  for  example, 
(v^)'^  =  X  and  =  x.  This  does  not  work  with  the  sine  and  the  "inverse  sine"  because 
the  inverse  sine  is  the  inverse  of  the  truncated  sine  function,  not  the  real  sine  function. 
It  is  true  that  sin(arcsin(x))  =  x,  that  is,  the  sine  undoes  the  arcsine.  It  is  not  true  that 
the  arcsine  undoes  the  sine,  for  example,  sin(57r/6)  =  1/2  and  arcsin(l/2)  =  7r/6,  so  doing 
first  the  sine  then  the  arcsine  does  not  get  us  back  where  we  started.  This  is  because  57r/6 
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^  \  ^ 

271-         -37r/2  -7 
1  - 

f\        -7r/2  y 
\   -1 

7r/2 

7r/2  - 

TT^^      37r/2  /27I- 

-7I-/2  y/ 
  -1 

7r/2                                  -1  / 
L7r/2  - 

1 

Figure  4.9.1     The  sine,  the  truncated  sine,  the  inverse  sine. 

is  not  in  the  domain  of  the  truncated  sine.  If  we  start  with  an  angle  between  — 7r/2  and 
7r/2  then  the  arcsine  does  reverse  the  sine:  sin(7r/6)  =  1/2  and  arcsin(l/2)  =  7r/6. 

What  is  the  derivative  of  the  arcsine?  Since  this  is  an  inverse  function,  we  can  discover 
the  derivative  by  using  imphcit  differentiation.  Suppose  y  =  arcsin(a:).  Then 

sin(y)  =  sin(arcsin(x))  =  x. 

Now  taking  the  derivative  of  both  sides,  we  get 

y'  cos  y  =  \ 


As  we  expect  when  using  imphcit  differentiation,  y  appears  on  the  right  hand  side  here. 
We  would  certainly  prefer  to  have  y'  written  in  terms  of  x,  and  as  in  the  case  of  Inx  we 
can  actually  do  that  here.  Since  sin^  y  +  cos^  y  =  1,  cos^  y  =  1  —  s'm^  y  =  1  —  .  So 
cosy  =  ±a/i  —  x^  ,  but  which  is  it — plus  or  minus?  It  could  in  general  be  either,  but  this 
isn't  "in  general":  since  y  =  arcsin(a;)  we  know  that  —tx/2  <  y  <  n/2,  and  the  cosine  of 
an  angle  in  this  interval  is  always  positive.  Thus  cos  y  =  \/l  —  x'^  and 

—  arcsm(x)  =  =. 
dx  ^  ^  Vl-x2 

Note  that  this  agrees  with  figure  4.9.1:  the  graph  of  the  arcsine  has  positive  slope  every- 
where. 

We  can  do  something  similar  for  the  cosine.  As  with  the  sine,  we  must  first  truncate 
the  cosine  so  that  it  can  be  inverted,  as  shown  in  figure  4.9.2.    Then  we  use  implicit 
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Figure  4.9.2     The  truncated  cosine,  the  inverse  cosine. 


differentiation  to  find  tfiat 


dx 


arccos(a;) 


-1 


Note  tfiat  tfie  truncated  cosine  uses  a  different  interval  than  the  truncated  sine,  so  that  if 
y  =  arccos(x)  we  know  that  0  <  y  <  tt.  The  computation  of  the  derivative  of  the  arccosine 
is  left  as  an  exercise. 

Finally  we  look  at  the  tangent;  the  other  trigonometric  functions  also  have  "partial 
inverses"  but  the  sine,  cosine  and  tangent  are  enough  for  most  purposes.  The  tangent, 
truncated  tangent  and  inverse  tangent  are  shown  in  figure  4.9.3;  the  derivative  of  the 
arctangent  is  left  as  an  exercise. 


tt/S 


./2 


Figure  4.9.3     The  tangent,  the  truncated  tangent,  the  inverse  tangent. 


Exercises  4-9. 

1.  Show  that  the  derivative  of  arccosx  is 

2.  Show  that  the  derivative  of  arctana;  is 


1 

1  +  a;2' 
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3.  The  inverse  of  cot  is  usually  defined  so  that  the  range  of  arccot  is  (0,7r).  Sketch  the  graph 
of  y  =  arccot  x.  In  the  process  you  will  make  it  clear  what  the  domain  of  arccot  is.  Find  the 
derivative  of  the  arccotangent.  =^ 

4.  Show  that  arccot  x  +  arctanx  =  tt/2. 

5.  Find  the  derivative  of  arcsin(x^). 

6.  Find  the  derivative  of  arctan(e'^). 

7.  Find  the  derivative  of  arccos(sinx^) 

8.  Find  the  derivative  of  ln((arcsin  x)'^)  =^> 

9.  Find  the  derivative  of  arccose"^  =^ 

10.  Find  the  derivative  of  arcsina;  +  arccosx  =^ 

11.  Find  the  derivative  of  log5(arctan(x^))  =^ 

4.10    Limits  revisited 

We  have  defined  and  used  the  concept  of  limit,  primarily  in  our  development  of  the  deriva- 
tive. Recall  that  Um  /(x)  =  L  is  true  if,  in  a  precise  sense,  f{x)  gets  closer  and  closer  to 

x—^a 

L  as  X  gets  closer  and  closer  to  a.  While  some  limits  are  easy  to  see,  others  take  some 
ingenuity;  in  particular,  the  limits  that  define  derivatives  are  always  difficult  on  their  face, 
since  in 

^.^  f{x  +  Ax)-f{x) 

Ax^O  Ax 

both  the  numerator  and  denominator  approach  zero.  Typically  this  difficulty  can  be  re- 
solved when  /  is  a  "nice"  function  and  we  are  trying  to  compute  a  derivative.  Occasionally 
such  limits  are  interesting  for  other  reasons,  and  the  limit  of  a  fraction  in  which  both  nu- 
merator and  denominator  approach  zero  can  be  difficult  to  analyze.  Now  that  we  have 
the  derivative  available,  there  is  another  technique  that  can  sometimes  be  helpful  in  such 
circumstances. 

Before  we  introduce  the  technique,  we  will  also  expand  our  concept  of  limit.  We  will 
occasionally  want  to  know  what  happens  to  some  quantity  when  a  variable  gets  very  large 
or  "goes  to  infinity" . 

EXAMPLE  4.10.1  What  happens  to  the  function  cos(l/x)  as  x  goes  to  infinity?  It 
seems  clear  that  as  x  gets  larger  and  larger,  1/x  gets  closer  and  closer  to  zero,  so  cos(l/x) 
should  be  getting  closer  and  closer  to  cos(O)  =  1.  □ 

As  with  ordinary  limits,  this  concept  of  "limit  at  infinity"  can  be  made  precise. 
Roughly,  we  want  lim  f{x)  =  L  to  mean  that  we  can  make  f{x)  as  close  as  we  want 

x—^oo 

to  L  by  making  x  large  enough.  Compare  this  definition  to  the  definition  of  limit  in 
section  2.3. 
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DEFINITION  4.10.2   Limit  at  infinity     If  /  is  a  function,  we  say  that  lim  /(x)  =  L 

x—^oo 

if  for  every  e  >  0  there  is  an  >  0  so  that  whenever  x  >  N,  \f{x)  —  L\  <  e.  We  may 
similarly  define    lim   f{x)  =  L.  □ 


We  include  this  definition  for  completeness,  but  we  will  not  explore  it  in  detail.  Suffice 
it  to  say  that  such  limits  behave  in  much  the  same  way  that  ordinary  limits  do;  in  particular 
there  is  a  direct  analog  of  theorem  2.3.6. 

Now  consider  this  limit: 

2  2 
,         X^  —  IT^ 

lim   . 

x^-K  sinx 

As  X  approaches  tt,  both  the  numerator  and  denominator  approach  zero,  so  it  is  not 
obvious  what,  if  anything,  the  quotient  approaches.  We  can  often  compute  such  limits  by 
application  of  the  following  theorem. 

THEOREM  4.10.3    L'Hopital's  Rule      For  "sufficiently  nice"  functions  f{x)  and 

f(x) 

g{x),  if  lim  /(x)  =  0  =  Um  g{x)  or  lim  /(x)  =  ±oo  =  lim  g{x),  and  if  lim  —i—r  exists, 

X— >a  X— >a  X— x— >a  x— >a  g  (x) 

fix)  fix) 

then  lim  — —  =  lim  — — — .  This  remains  true  if  "x      a"  is  replaced  by  "x  — )■  oo"  or 

x^a  g[x)         x^a  g'[x) 

"x^-oo".  ■ 

This  theorem  is  somewhat  difficult  to  prove,  in  part  because  it  incorporates  so  many 
different  possibilities,  so  we  will  not  prove  it  here.  We  also  will  not  need  to  worry  about 
the  precise  definition  of  "sufficiently  nice" ,  as  the  functions  we  encounter  will  be  suitable. 

2  2 
X    —  TT 

EXAMPLE  4.10.4     Compute  hm    in  two  ways. 

x^TT  sinx 

First  we  use  L'Hopital's  Rule:  Since  the  numerator  and  denominator  both  approach 
zero, 

X^  —  TT^  2x 

hm    =  lim  - 


x^-K     smX  a;^.7rcOSx' 


provided  the  latter  exists.  But  in  fact  this  is  an  easy  limit,  since  the  denominator  now 
approaches  —1,  so 

X2  -  TT^  27r 
lim    =    =  — ZTT. 

x^-K    sinx  —1 
We  don't  really  need  L'Hopital's  Rule  to  do  this  limit.  Rewrite  it  as 
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and  note  that 


X  —  TT  X  —  TT  X 

lim    =  lini   z   =  lim 


x^n  sinx      a;^>7r  —  sin(a;  —  tt)     x^o  sinx 
since  x  —  tt  approaches  zero  as  x  approaches  tt.  Now 

X  —  TT  X 

hm  (x  +  tt)—  =  hm  (x  +  n)  lim  —  =  2n(—l)  =  —2it 

x-^-TT  smx      x-^TT  x->-o  smx 

as  before.  □ 

2x'^  —  Sx  +  7 

EXAMPLE  4.10.5     Compute  hm  —  —          in  two  ways. 

x->-oo  x'^  +  47a;  +  1 

As  X  goes  to  infinity  both  the  numerator  and  denominator  go  to  infinity,  so  we  may 
apply  L'Hopital's  Rule: 

2x^-Sx  +  7  Ax -3 

lim   —          =  hm  — . 

x^oo  x"^  +  47a;  +  1     x^oo  2x  +  47 

In  the  second  quotient,  it  is  still  the  case  that  the  numerator  and  denominator  both  go  to 
infinity,  so  we  are  allowed  to  use  L'Hopital's  Rule  again: 

4a; -3       ,  4 
hm  — -  =  hm  -  —  2. 

x—^oo  2x  +  47  a;— >-cxD  2 

So  the  original  limit  is  2  as  well. 

Again,  we  don't  really  need  L'Hopital's  Rule,  and  in  fact  a  more  elementary  approach 
is  easier — we  divide  the  numerator  and  denominator  by  a;^: 

2x^-3x  +  7            2a;2-3a;  +  7;^  2-|  +  ^ 

hm   —          =  hm   —  4-  =  hm 


aT-^'oo  a;2  +  47a;  +  1     x^oo  x"^  +  47a;  +  1  ^     x^oo  i  +  47  _|_  i  • 

x-^  X     '  x^ 

Now  as  X  approaches  infinity,  all  the  quotients  with  some  power  of  x  in  the  denominator 
approach  zero,  leaving  2  in  the  numerator  and  1  in  the  denominator,  so  the  limit  again  is 
2.  □ 

SGC  X  —  1 

EXAMPLE  4.10.6    Compute  lim  . 

x^-o    sin  a; 

Both  the  numerator  and  denominator  approach  zero,  so  applying  L'Hopital's  Rule: 

seca;  — 1           seca;tana;  1-0 
lim  =  hm  =   =  0. 

x^Q    smx        x^o     cos  a;  1  □ 


EXAMPLE  4.10.7    Compute  lim  a;  In  a;. 

x^0+ 
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This  doesn't  appear  to  be  suitable  for  L'Hopital's  Rule,  but  it  also  is  not  "obvious". 
As  X  approaches  zero,  In  a:  goes  to  — oo,  so  the  product  looks  like  (something  very  small)  • 
(something  very  large  and  negative).  But  this  could  be  anything:  it  depends  on  how  small 
and  how  large.  For  example,  consider  (x^)(l/x),  (x)(l/x),  and  (x)(l/x^).  As  x  approaches 
zero,  each  of  these  is  (something  very  small)  ■  (something  very  large),  yet  the  limits  are 
respectively  zero,  1,  and  oo. 

We  can  in  fact  turn  this  into  a  L'Hopital's  Rule  problem: 


xlnx 


\nx  lux 


1/x 


X 


Now  as  X  approaches  zero,  both  the  numerator  and  denominator  approach  infinity  (one 
—oo  and  one  +oo,  but  only  the  size  is  important).  Using  L'Hopital's  Rule: 


lim 


Inx 


lim 


1/x 


x^0+  X    ^        x^0+  —X 


lim  — (— x^) 

x^.0+  X 


lim 


-X 


0. 


One  way  to  interpret  this  is  that  since  lim  xlnx  —  0,  the  x  approaches  zero  much  faster 
than  the  Inx  approaches  — oo.  □ 


Exercises  4-10. 

Compute  the  limits. 

cos  X  —  1 

1.  iim 


x^o  smx 


3.     lim  \/ 

X — l-oo 

Inx 


\/x^ 


5. 
7. 


lim 


lim 


X 


9.  lim 

x^l 


2  -  v^^T2 


11. 


lim 


13.  lim 

x^Q 


a/4 


X 


15.     lim  f  ^  +  ^  )  (VtTT- 1) 
t^o+  \  t  ^t) 

in -If 


17.    lim  — — - 

u^l  [l/u 


3u 


19.  lim 


1  +  5/7^ 
0+  2  +  1/^ 


2.     lim  —  = 

x^oo  X 

Inx 

4.  lim   

x^oo  X 


6. 
8. 


lim 

X — >OG  g 


—  e^^ 
1/t)  -  1 


t^i+  i2  _  2i  +  1 

t  +  5-2/t-l/t^ 


10.  lim 


12.  lim 


>1       - 1 

14.  lim(t+^)((4-t)^/2-8) 


16.  lim 


18.  lim 


0  V2x  +  1  -  1 
2  + (1/x) 


20.  lim 


5  3-  (2/x) 
3  + 


+  X 


0+      2  +  4X-V2 
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3;  +  a;l/2  +  3,1/3  1 


21.     lim   —  -7- —  ^  22.  lim 

x^oo  -|-  x^'^  t^oo  r,  _  /4t+l 

f+2 


r.2 


49.  lim 


+  4x  +  8 


2 


23.    lim   ^  ^  24.      lim      ^  +  ^ 


t— >-oo    /    t  a;— — 00  1  -)-  -v/T" 

V  *-i 

COS  3;  —  1 

25.      hm  - — -T          ^  26.  lim 


x^it/2  (7r/2)  —  X  a;->-0  X 

27.    lim   =^  28.  lim 

23^0      —  X  —  1  X  —  1 

29.    lim  — ^  — ^  ^  30.  lim 


23^0                 X  X->-l    X^    —  1 

sin(2x)  x^/"^  -  1 

31.    lim  — ^  ^  32.    lim    = 

33-^0  ln(x  +  1)  x^i  X 

33.     lim  34.    lim  -^^  —  ^ 

X  —  1  x->-l    X  —  1 

X^'^+X^^''^  X  + 


35.     lim   -775—  36.  lim 

a;^oo     X  +  X        ^  x^oo  2x  +  X  ^ 

37.     lim    ^^-^^  \  ^  38.  lim 


00  1  +  2x^1  a;^oo  ^2x2  +  1 

^„  3x2  +  x  +  2  ^. 

39.    lim    =^  40.    lim    ,    = 

x-^O        X-4  a;^0         +  4  -  2 

41.    lim  =,  42.     lim  ^^±1 

43.    lim         ^   =^  44.     lim  (x  +  5)  '  ' 


>0  ^x  +  l-1  x^oo"-  ^  \2x       x  +  2 

45.     lim  (x  +  5)(^H  ^^)^  46.    lim(x  +  5)(^+  ^ 


0+  \2x  x  +  2y  x^i  \2x  X 

x^-6x-2  x^-6x-2 
47.    lim   =^  48.    lim   ^ 

x^2        X'^  +  4  x^2      x^  —  4x 


x^\^     2x3  _  2 

X 

50.  The  function  /(x)  =  -  has  two  horizontal  asymptotes.  Find  them  and  give  a  rough 

Vx^  +  1 

sketch  of  /  with  its  horizontal  asymptotes. 


4.11    Hyperbolic  Functions 

The  hyperbolic  functions  appear  with  some  frequency  in  appHcations,  and  are  quite  similar 
in  many  respects  to  the  trigonometric  functions.  This  is  a  bit  surprising  given  our  initial 
definitions. 
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DEFINITION  4.11.1     The  hyperbolic  cosine  is  the  function 

G"^   I  ^ 

coshx  =  , 

and  the  hyperbolic  sine  is  the  function 

  g 

sinhx  =   . 

2 

□ 

Notice  that  cosh  is  even  (that  is,  cosh(— x)  =  cosh(a;))  while  sinh  is  odd  (sinh(— x)  = 
—  sinh(x)),  and  coshx  +  sinhx  =  e^.  Also,  for  all  x,  coshx  >  0,  while  sinhx  =  0  if  and 
only  if     —       =  0,  which  is  true  precisely  when  x  =  0. 

LEMMA  4.11.2    The  range  of  coshx  is  [1,  oo). 
Proof.    Let  y  =  coshx.  We  solve  for  x: 

2y  =     +  e"" 
2ye"  =  e^"  +  1 

0  =  e^^  -  2ye^  +  1 


2y  ±  v/4y2  -  4 


From  the  last  equation,  we  see      >  1,  and  since  y  >  0,  it  follows  that  y  >  1. 
Now  suppose  y  >  1,  so  y  ±        _  1  >  0.  Then  x  =  ln{y  ±  v^y^^) 

is  a  real  number, 

and  y  =  coshx,  so  y  is  in  the  range  of  cosh(x).  ■ 

DEFINITION  4.11.3    The  other  hyperbohc  functions  are 

sinhx 


tanhx  = 
cothx  = 
sech  X  = 
cschx 


coshx 
coshx 
sinhx 
1 

coshx 
1 


sinhx 

The  domain  of  coth  and  csch  is  x  7^  0  while  the  domain  of  the  other  hyperbolic  functions 
is  all  real  numbers.  Graphs  are  shown  in  figure  4.11.1  □ 


4.11    Hyperbolic  Functions  103 


-2-10     1  2-2 


2  -2 


2  -2 


-4  -I  -4'-!  -41 

Figure  4.11.1     The  hyperbolic  functions:  cosh,  sinh,  tanh,  sech,  csch,  coth. 

Certainly  the  hyperbolic  functions  do  not  closely  resemble  the  trigonometric  functions 
graphically.  But  they  do  have  analogous  properties,  beginning  with  the  following  identity. 


THEOREM  4.11.4    For  all  x  in  M,  cosh^  x  -  sinh^  x  =  1. 


Proof.     The  proof  is  a  straightforward  computation: 


cosh^  x— sinh^  x 


This  immediately  gives  two  additional  identities: 


1  —  tanh  X  =  sech  x 


and 


coth^  X  —  1  =  csch^  X. 


The  identity  of  the  theorem  also  helps  to  provide  a  geometric  motivation.  Recall  that 


the  graph  of  x^ 


y 


1  is  a  hyperbola  with  asymptotes  x  =  ±y  whose  x-intercepts  are 


±1.  If  {x,y)  is  a  point  on  the  right  half  of  the  hyperbola,  and  if  we  let  x  =  cosht,  then 

y  =  ±a/x^  —  1  =  ±  V cosh^  X  —  1  =  ±sinht.  So  for  some  suitable  t,  cosht  and  sinht  are 
the  coordinates  of  a  typical  point  on  the  hyperbola.  In  fact,  it  turns  out  that  t  is  twice 
the  area  shown  in  the  first  graph  of  figure  4.11.2.  Even  this  is  analogous  to  trigonometry; 
cost  and  sint  are  the  coordinates  of  a  typical  point  on  the  unit  circle,  and  t  is  twice  the 
area  shown  in  the  second  graph  of  figure  4.11.2. 

Given  the  definitions  of  the  hyperbolic  functions,  finding  their  derivatives  is  straight- 
forward. Here  again  we  see  similarities  to  the  trigonometric  functions. 


THEOREM  4.11.5 


Proof.     —  cosh  X 
ax 


coshx  =  sinhx  and       sinhx  =  coshx. 


d  +  e~ 
dx  2 


sinhx,  and  —  sinhx 
dx 


d  —  e  ^ 
dx  2 


+  e" 


coshx. 
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(cosh  t,  sinh  t) 


(cost,  sint) 


Figure  4.11.2     Geometric  definitions  of  sin,  cos,  sinh,  cosh:  t  is  twice  the  shaded  area  in 
each  figure. 

Since  coshx  >  0,  sinhx  is  increasing  and  hence  injective,  so  sinhx  has  an  inverse, 
arcsinhx.  Also,  sinh  a;  >  0  when  a;  >  0,  so  coshx  is  injective  on  [0,  oo)  and  has  a  (partial) 
inverse,  arccoshx.  The  other  hyperbolic  functions  have  inverses  as  well,  though  arcsechx 
is  only  a  partial  inverse.  We  may  compute  the  derivatives  of  these  functions  as  we  have 
other  inverse  functions. 


THEOREM  4.11.6 


d 

dx 


arcsinh  x 


1 


a/TTx^' 


d 


Proof.     Let  y  =  arcsinh  x,  so  sinhy  —  x.   Then  —  sinhy  =  cosh(y)  ■  y'  =  1,  and  so 

UjJu 


coshy 


1  +  sinh^  y  + 


x^ 


The  other  derivatives  are  left  to  the  exercises. 


Exercises 

1.  Show  that  the  range  of  sinh  a;  is  all  real  numbers.    (Hint:   show  that  if  y  =  sinhx  then 
x  =  ln{y+^y^  +  l).) 

2.  Compute  the  following  limits: 

a.  lim  coshx 

b.  lim  sinhx 

c.  lim  tanhx 

X — >QG 

d.  lim  (cosh  a;  —  sinhx) 

X — >OG 

3.  Show  that  the  range  of  tanhx  is  (—1,1).  What  are  the  ranges  of  coth,  sech,  and  csch?  (Use 
the  fact  that  they  are  reciprocal  functions.) 

4.  Prove  that  for  every  x,y  G  M,  sinh(a;  +  y)  =  sinh  a;  cosh  y  +  cosh  a;  sinh  y.  Obtain  a  similar 
identity  for  sinh(x  —  y). 
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5.  Prove  that  for  every  x,  y  G  M,  cosh(x  +  y)  =  cosh  x  cosh  y  +  sinhxsinhy.  Obtain  a  similar 
identity  for  cosh(x  —  y). 

6.  Use  exercises  4  and  5  to  show  that  sinh(2x)  =  2sinhxcosha;  and  cosh(2x)  =  cosh'^  x  +  sinh'^  x 
for  every  x.  Conclude  also  that  (cosh(2a;)  —  l)/2  =  sinh'^  x. 

7.  Show  that  4~(tanha;)  =  sech'^  x.    Compute  the  derivatives  of  the  remaining  hyperbolic 

ax 

functions  as  well. 

8.  What  are  the  domains  of  the  six  inverse  hyperbolic  functions? 

9.  Sketch  the  graphs  of  all  six  inverse  hyperbolic  functions. 


5 

Curve  Sketching 


Whether  we  are  interested  in  a  function  as  a  purely  mathematical  object  or  in  connection 
with  some  application  to  the  real  world,  it  is  often  useful  to  know  what  the  graph  of 
the  function  looks  like.  We  can  obtain  a  good  picture  of  the  graph  using  certain  crucial 
information  provided  by  derivatives  of  the  function  and  certain  limits. 

5.1    Maxima  and  Minima 

A  local  maximum  point  on  a  function  is  a  point  (x,  y)  on  the  graph  of  the  function 
whose  y  coordinate  is  larger  than  all  other  y  coordinates  on  the  graph  at  points  "close 
to"  {x,y).  More  precisely,  (x, /(x))  is  a  local  maximum  if  there  is  an  interval  (a,  6)  with 
a  <  X  <  b  and  /(x)  >  f{z)  for  every  z  in  (a,  6).  Similarly,  (x,  y)  is  a  local  minimum 
point  if  it  has  locally  the  smallest  y  coordinate.  Again  being  more  precise:  (x,  /(x))  is  a 
local  minimum  if  there  is  an  interval  (a,  b)  with  a  <  x  <  b  and  /(x)  <  f{z)  for  every  z  in 
(a,  b).  A  local  extremum  is  either  a  local  minimum  or  a  local  maximum. 

Local  maximum  and  minimum  points  are  quite  distinctive  on  the  graph  of  a  function, 
and  are  therefore  useful  in  understanding  the  shape  of  the  graph.  In  many  applied  problems 
we  want  to  find  the  largest  or  smallest  value  that  a  function  achieves  (for  example,  we  might 
want  to  find  the  minimum  cost  at  which  some  task  can  be  performed)  and  so  identifying 
maximum  and  minimum  points  will  be  useful  for  applied  problems  as  well.  Some  examples 
of  local  maximum  and  minimum  points  are  shown  in  figure  5.1.1. 

If  (x,  /(x))  is  a  point  where  /(x)  reaches  a  local  maximum  or  minimum,  and  if  the 
derivative  of  /  exists  at  x,  then  the  graph  has  a  tangent  line  and  the  tangent  line  must  be 
horizontal.  This  is  important  enough  to  state  as  a  theorem,  though  we  will  not  prove  it. 
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A 


Figure  5.1.1     Some  local  maximum  points  {A)  and  minimum  points  {B). 


THEOREM  5.1.1    Fermat's  Theorem     If  /(x)  has  a  local  extremum  aX  x  =  a  and 


Thus,  the  only  points  at  which  a  function  can  have  a  local  maximum  or  minimum 
are  points  at  which  the  derivative  is  zero,  as  in  the  left  hand  graph  in  figure  5.1.1,  or  the 
derivative  is  undefined,  as  in  the  right  hand  graph.  Any  value  of  x  for  which  f'{x)  is  zero  or 
undefined  is  called  a  critical  value  for  /.  When  looking  for  local  maximum  and  minimum 
points,  you  are  likely  to  make  two  sorts  of  mistakes:  You  may  forget  that  a  maximum  or 
minimum  can  occur  where  the  derivative  does  not  exist,  and  so  forget  to  check  whether 
the  derivative  exists  everywhere.  You  might  also  assume  that  any  place  that  the  derivative 
is  zero  is  a  local  maximum  or  minimum  point,  but  this  is  not  true.  A  portion  of  the  graph 
of  f{x)  =  x^  is  shown  in  figure  5.1.2.  The  derivative  of  /  is  f'{x)  =  ?>x^,  and  /'(O)  =  0, 
but  there  is  neither  a  maximum  nor  minimum  at  (0,  0). 


Figure  5.1.2     No  maximum  or  minimum  even  though  the  derivative  is  zero. 

Since  the  derivative  is  zero  or  undefined  at  both  local  maximum  and  local  minimum 
points,  we  need  a  way  to  determine  which,  if  either,  actually  occurs.  The  most  elementary 
approach,  but  one  that  is  often  tedious  or  difficult,  is  to  test  directly  whether  the  y  coor- 
dinates "near"  the  potential  maximum  or  minimum  are  above  or  below  the  y  coordinate 
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at  the  point  of  interest.  Of  course,  there  are  too  many  points  "near"  the  point  to  test,  but 
a  httle  thought  shows  we  need  only  test  two  provided  we  know  that  /  is  continuous  (recall 
that  this  means  that  the  graph  of  /  has  no  jumps  or  gaps). 

Suppose,  for  example,  that  we  have  identified  three  points  at  which  /'  is  zero  or 
nonexistent:  (a:2, 2/2)5  (a^s,  2/3)5  and  Xi  <  X2  <  ^3  (see  figure  5.1.3).  Suppose  that 

we  compute  the  value  of  /(a)  for  xi  <  a  <  X2,  and  that  /(a)  <  /(X2).  What  can  we  say 
about  the  graph  between  a  and  X2?  Could  there  be  a  point  {b,f{b)),  a  <  b  <  X2  with 
f{b)  >  /(X2)?  No:  if  there  were,  the  graph  would  go  up  from  (a, /(a))  to  (6, /(6))  then 
down  to  (x2,  f{x2))  and  somewhere  in  between  would  have  a  local  maximum  point.  (This 
is  not  obvious;  it  is  a  result  of  the  Extreme  Value  Theorem,  theorem  6.1.2.)  But  at  that 
local  maximum  point  the  derivative  of  /  would  be  zero  or  nonexistent,  yet  we  already 
know  that  the  derivative  is  zero  or  nonexistent  only  at  xi,  X2,  and  0:3.  The  upshot  is  that 
one  computation  tells  us  that  (x2,  /(a^2))  has  the  largest  y  coordinate  of  any  point  on  the 
graph  near  X2  and  to  the  left  of  X2-  We  can  perform  the  same  test  on  the  right.  If  we  find 
that  on  both  sides  of  X2  the  values  are  smaller,  then  there  must  be  a  local  maximum  at 
{x2-,  /  (X2));  if  we  find  that  on  both  sides  of  X2  the  values  are  larger,  then  there  must  be  a 
local  minimum  at  (x2,  /(X2));  if  we  find  one  of  each,  then  there  is  neither  a  local  maximum 
or  minimum  at  X2. 

• 

n  \     \         \  \ 

xi  a     b  X2  X3 

Figure  5.1.3     Testing  for  a  maximum  or  minimum. 

It  is  not  always  easy  to  compute  the  value  of  a  function  at  a  particular  point.  The 
task  is  made  easier  by  the  availability  of  calculators  and  computers,  but  they  have  their 
own  drawbacks — they  do  not  always  allow  us  to  distinguish  between  values  that  are  very 
close  together.  Nevertheless,  because  this  method  is  conceptually  simple  and  sometimes 
easy  to  perform,  you  should  always  consider  it. 

EXAMPLE  5.1.2  Find  all  local  maximum  and  minimum  points  for  the  function  /(x)  = 
x^  —  X.  The  derivative  is  /'(x)  =  3x^  —  1.  This  is  defined  everywhere  and  is  zero  at 
X  =  ±-\/3/3.  Looking  first  at  x  =  we  see  that  /(V3/3)  =  -2^3/9.  Now  we  test 

two  points  on  either  side  of  x  =  v^/3,  making  sure  that  neither  is  farther  away  than 
the  nearest  critical  value;  since  ^3  <  3,  V3/3  <  1  and  we  can  use  x  =  0  and  x  =  1. 
Since  /(O)  =  0  >  —2V3/9  and  /(I)  =  0  >  — 2-\/3/9,  there  must  be  a  local  minimum  at 
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X  =  -\/3/3.  For  X  =  — -\/3/3,  we  see  that  /(— -\/3/3)  =  2-\/3/9.  This  time  we  can  use  a;  =  0 
and  X  =  —1,  and  we  find  that  /(—I)  =  /(O)  =  0  <  2-\/3/9,  so  there  must  be  a  local 
maximum  at  a;  =  —^3/3.  □ 

Of  course  this  example  is  made  very  simple  by  our  choice  of  points  to  test,  namely 
X  =  —1,  0,  1.  We  could  have  used  other  values,  say  —5/4,  1/3,  and  3/4,  but  this  would 
have  made  the  calculations  considerably  more  tedious. 

EXAMPLE  5.1.3  Find  all  local  maximum  and  minimum  points  for  f{x)  =  sin  x  +  cos  x. 
The  derivative  is  f'{x)  =  cosx  —  sinx.  This  is  always  defined  and  is  zero  whenever 
cosx  =  sinx.  Recalling  that  the  cosx  and  sinx  are  the  x  and  y  coordinates  of  points  on  a 
unit  circle,  we  see  that  cosx  =  sinx  when  x  is  7r/4,  7r/4±7r,  n/4±2n,  tt/4±3tc,  etc.  Since 
both  sine  and  cosine  have  a  period  of  27r,  we  need  only  determine  the  status  of  x  =  7r/4 
and  X  =  57r/4.  We  can  use  0  and  7r/2  to  test  the  critical  value  x  =  7r/4.  We  find  that 
/(7r/4)  =  V2,  /(O)  =  1<  ^2  and  f{n/2)  =  1,  so  there  is  a  local  maximum  when  x  =  7r/4 
and  also  when  x  =  7r/4  ±  2n,  n/4:±  An,  etc.  We  can  summarize  this  more  neatly  by  saying 
that  there  are  local  maxima  at  7r/4  ±  2kn  for  every  integer  k. 

We  use  TT  and  27r  to  test  the  critical  value  x  =  57r/4.  The  relevant  values  are  /(57r/4)  = 
—V2,  fin)  =  —1  >  —V2,  f{2n)  =  1  >  —V2,  so  there  is  a  local  minimum  at  x  =  5n/4, 
5n/4±2n,  5n/4±4n,  etc.  More  succinctly,  there  are  local  minima  at  57r/4±2/c7r  for  every 
integer  k.  □ 

Exercises  5.1. 

In  problems  1-12,  find  all  local  maximum  and  minimum  points  (x,  y)  by  the  method  of  this  section. 
1.    y  =      —  X  ^  2.    y  =  2  +  3a;  —  =^ 

3.    y  =  x^  -  9x^  +  24a;  ^  4.    y  =  a;'^  -  2a;^  +  3  ^ 

5.   y  =  2)x'^  —  Ax^  =^  6.   y  =  (x^  —  l)/x 

7.   y  =       —  (1/a;^)  8.   y  =  cos(2x)  —  x  =^> 


X  —  3    X  <  3 


=  III 


11.    /(x)  =  x^  -  98x  +  4 


13.  For  any  real  number  x  there  is  a  unique  integer  n  such  that  n  <  x  <  n  +  1,  and  the  greatest 
integer  function  is  defined  as  [xj  =  n.  Where  are  the  critical  values  of  the  greatest  integer 
function?  Which  are  local  maxima  and  which  are  local  minima? 

14.  Explain  why  the  function  /(x)  =  1/x  has  no  local  maxima  or  minima. 

15.  How  many  critical  points  can  a  quadratic  polynomial  function  have?  =^> 
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16.  Show  that  a  cubic  polynomial  can  have  at  most  two  critical  points.  Give  examples  to  show 
that  a  cubic  polynomial  can  have  zero,  one,  or  two  critical  points. 

17.  Explore  the  family  of  functions  f{x)  =  +  cx  +  1  where  c  is  a  constant.  How  many  and 
what  types  of  local  extremes  are  there?  Your  answer  should  depend  on  the  value  of  c,  that 
is,  different  values  of  c  will  give  different  answers. 

18.  We  generalize  the  preceding  two  questions.  Let  n  be  a  positive  integer  and  let  /  be  a  poly- 
nomial of  degree  n.  How  many  critical  points  can  /  have?  (Hint:  Recall  the  Fundamental 
Theorem  of  Algebra,  which  says  that  a  polynomial  of  degree  n  has  at  most  n  roots.) 

5.2    The  first  derivative  test 

The  method  of  the  previous  section  for  deciding  whether  there  is  a  local  maximum  or 
minimum  at  a  critical  value  is  not  always  convenient.  We  can  instead  use  information 
about  the  derivative  f'{x)  to  decide;  since  we  have  already  had  to  compute  the  derivative 
to  find  the  critical  values,  there  is  often  relatively  little  extra  work  involved  in  this  method. 

How  can  the  derivative  tell  us  whether  there  is  a  maximum,  minimum,  or  neither  at 
a  point?  Suppose  that  /'(a)  =  0.  If  there  is  a  local  maximum  when  x  =  a,  the  function 
must  be  lower  near  x  =  a  than  it  is  right  at  x  =  a.  If  the  derivative  exists  near  x  =  a,  this 
means  f'{x)  >  0  when  x  is  near  a  and  x  <  a,  because  the  function  must  "slope  up"  just 
to  the  left  of  a.  Similarly,  f'{x)  <  0  when  x  is  near  a  and  x  >  a,  because  /  slopes  down 
from  the  local  maximum  as  we  move  to  the  right.  Using  the  same  reasoning,  if  there  is 
a  local  minimum  at  a;  =  a,  the  derivative  of  /  must  be  negative  just  to  the  left  of  a  and 
positive  just  to  the  right.  If  the  derivative  exists  near  a  but  does  not  change  from  positive 
to  negative  or  negative  to  positive,  that  is,  it  is  positive  on  both  sides  or  negative  on  both 
sides,  then  there  is  neither  a  maximum  nor  minimum  when  x  —  a.  See  the  first  graph  in 
figure  5.1.1  and  the  graph  in  figure  5.1.2  for  examples. 

EXAMPLE  5.2.1  Find  all  local  maximum  and  minimum  points  for  /(x)  =  sin  a;  +  cos  x 
using  the  first  derivative  test.  The  derivative  is  f'{x)  =  cosx  — sinx  and  from  example  5.1.3 
the  critical  values  we  need  to  consider  are  7r/4  and  57r/4. 

The  graphs  of  sinx  and  cosx  are  shown  in  figure  5.2.1.  Just  to  the  left  of  7r/4  the 
cosine  is  larger  than  the  sine,  so  /'(x)  is  positive;  just  to  the  right  the  cosine  is  smaller 
than  the  sine,  so  /'(x)  is  negative.  This  means  there  is  a  local  maximum  at  7r/4.  Just  to 
the  left  of  57r/4  the  cosine  is  smaller  than  the  sine,  and  to  the  right  the  cosine  is  larger 
than  the  sine.  This  means  that  the  derivative  /'(x)  is  negative  to  the  left  and  positive  to 
the  right,  so  /  has  a  local  minimum  at  57r/4.  □ 

Exercises  5.2. 

In  1-13,  find  all  critical  points  and  identify  them  as  local  maximum  points,  local  minimum  points, 
or  neither. 
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1. 
3. 
5. 
7. 


X 


X 


9x  +  24a; 


3a;  —  4a; 


3a; 


(1/x^) 


9.    f{x)  =  {5-  x)/{x  +  2) 
11.    f{x)  =  x^/{x  +  l)  =^ 
13.  fix) 


sin^  X 


Figure  5.2.1 


The  sine  and  cosine. 
2.    y  =  2  +  3a;- 


X 


x"  -  2a;"  +  3 

2 


4.  y  = 

6.  y  =  (x"  —  l)/x 

8.  y  =  cos(2x)  —  X  ■ 

10.  /(x)  =  \x^  -  121 

„2 


12.  fix) 


X  sin(l/a;)  x  ^  0 
0  x  =  0 


14.  Find  the  maxima  and  minima  of  fix)  =  secx. 

15.  Let  fiO)  =  cos^(0)  —  2  sin(6').  Find  the  intervals  where  /  is  increasing  and  the  intervals  where 
/  is  decreasing  in  [0,27r].  Use  this  information  to  classify  the  critical  points  of  /  as  either 
local  maximums,  local  minimums,  or  neither.  =^ 


16.  Let  r  >  0.    Find  the  local  maxima  and  minima  of  the  function  fix) 
domain  \—r,  r] . 


on  its 


17.  Let  fix)  =  ax  +  bx  +  c  with  a  7^  0.  Show  that  /  has  exactly  one  critical  point  using  the 
first  derivative  test.  Give  conditions  on  a  and  b  which  guarantee  that  the  critical  point  will 
be  a  maximum.  It  is  possible  to  see  this  without  using  calculus  at  all;  explain. 


5.3    The  second  derivative  test 

The  basis  of  the  first  derivative  test  is  that  if  the  derivative  changes  from  positive  to 
negative  at  a  point  at  which  the  derivative  is  zero  then  there  is  a  local  maximum  at  the 
point,  and  similarly  for  a  local  minimum.  If  /'  changes  from  positive  to  negative  it  is 
decreasing;  this  means  that  the  derivative  of  /',  /",  might  be  negative,  and  if  in  fact  /" 
is  negative  then  /'  is  definitely  decreasing,  so  there  is  a  local  maximum  at  the  point  in 
question.  Note  well  that  /'  might  change  from  positive  to  negative  while  /"  is  zero,  in 
which  case  /"  gives  us  no  information  about  the  critical  value.  Similarly,  if  /'  changes 
from  negative  to  positive  there  is  a  local  minimum  at  the  point,  and  /'  is  increasing.  If 
/"  >  0  at  the  point,  this  tells  us  that  /'  is  increasing,  and  so  there  is  a  local  minimum. 
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EXAMPLE  5.3.1  Consider  again  /(x)  =  sinx  +  cosx,  with  /'(x)  =  cosx  —  sinx  and 
/"(x)  =  —  sinx  —  cosx.  Since  f'ijx/A)  =  —\/2/2  —  \f2j2  =  —\f2  <  0,  we  know  there  is  a 

local  maximum  at  tt/4.  Since  f"{5n/4)  =  V2/2  V2/2  =  \/2  >  0,  there  is  a  local 

minimum  at  57r/4.  □ 

When  it  works,  the  second  derivative  test  is  often  the  easiest  way  to  identify  local  max- 
imum and  minimum  points.  Sometimes  the  test  fails,  and  sometimes  the  second  derivative 
is  quite  difficult  to  evaluate;  in  such  cases  we  must  fall  back  on  one  of  the  previous  tests. 

EXAMPLE  5.3.2  Let  /(x)  =  x^.  The  derivatives  are  /'(x)  =  4x^  and  f"(x)  =  12x^ 
Zero  is  the  only  critical  value,  but  /"(O)  =  0,  so  the  second  derivative  test  tells  us  nothing. 
However,  /(x)  is  positive  everywhere  except  at  zero,  so  clearly  /(x)  has  a  local  minimum 
at  zero.  On  the  other  hand,  /(x)  =  — x^  also  has  zero  as  its  only  critical  value,  and  the 
second  derivative  is  again  zero,  but  — x^  has  a  local  maximum  at  zero.  □ 

Exercises  5.3. 


Find  all  local  maximum  and  minimum  points  by  the  second  derivative  test. 


1. 

y  = 

2  ^ 
X    —  X  ^ 

2. 

y 

=  2  +  3x-  x^  ^ 

3. 

y  = 

x^  -  9x'^  +  24x  =^ 

4. 

y 

=  x^  -2x^  +  3^ 

5. 

y  = 

3X^  -  4:X^  =^ 

6. 

y 

=  {x^  -  l)/x  =^ 

7. 

y  = 

3x'^  -  ^ 

8. 

y 

=  cos(2x)  —  X  =^ 

9. 

y  = 

4a;  +  a/1  -  a;  =^ 

10. 

y 

=  (a;  +  l)/\/5x2  +  35 

11. 

y  = 

5  ^ 
X  —  a;  =^ 

12. 

y 

=  6x  +  sin  3x 

13. 

y  = 

X  +  1/x  =^ 

14. 

y 

=  a;^  +  1/a;  =^ 

15. 

y  = 

16. 

y 

=  tan^  X  =^> 

17. 

y  = 

2            ■  2 

cos  X  —  sm  X  =^ 

18. 

y 

•    3  ^ 

=  sm  X 

5.4    Concavity  and  inflection  points 

We  know  that  the  sign  of  the  derivative  tells  us  whether  a  function  is  increasing  or  decreas- 
ing; for  example,  when  /'(x)  >  0,  /(x)  is  increasing.  The  sign  of  the  second  derivative 
/"(x)  tells  us  whether  /'  is  increasing  or  decreasing;  we  have  seen  that  if  /'  is  zero  and 
increasing  at  a  point  then  there  is  a  local  minimum  at  the  point,  and  if  /'  is  zero  and 
decreasing  at  a  point  then  there  is  a  local  maximum  at  the  point.  Thus,  we  extracted 
information  about  /  from  information  about  /". 

We  can  get  information  from  the  sign  of  /"  even  when  /'  is  not  zero.  Suppose  that 
/"(a)  >  0.  This  means  that  near  x  =  a,  /'  is  increasing.  If  /'(a)  >  0,  this  means  that  / 
slopes  up  and  is  getting  steeper;  if  /'(a)  <  0,  this  means  that  /  slopes  down  and  is  getting 
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a 


a 


Figure  5.4.1     f'io)  >  0:  /'(a)  positive  and  increasing,  /'(a)  negative  and  increasing. 

less  steep.  The  two  situations  are  shown  in  figure  5.4.1.  A  curve  that  is  shaped  hke  this 
is  called  concave  up. 

Now  suppose  that  /"(a)  <  0.  This  means  that  near  x  =  a,  f  is  decreasing.  If 
/'(a)  >  0,  this  means  that  /  slopes  up  and  is  getting  less  steep;  if  /'(a)  <  0,  this  means 
that  /  slopes  down  and  is  getting  steeper.  The  two  situations  are  shown  in  figure  5.4.2.  A 
curve  that  is  shaped  like  this  is  called  concave  down. 


Figure  5.4.2     f"{a)  <  0:  /'(a)  positive  and  decreasing,  f'{a)  negative  and  decreasing. 

If  we  are  trying  to  understand  the  shape  of  the  graph  of  a  function,  knowing  where 
it  is  concave  up  and  concave  down  helps  us  to  get  a  more  accurate  picture.  Of  particular 
interest  are  points  at  which  the  concavity  changes  from  up  to  down  or  down  to  up;  such 
points  are  called  inflection  points.  If  the  concavity  changes  from  up  to  down  at  x  =  a,  /" 
changes  from  positive  to  the  left  of  a  to  negative  to  the  right  of  a,  and  usually  /"(a)  =  0. 
We  can  identify  such  points  by  first  finding  where  f"{x)  is  zero  and  then  checking  to  see 
whether  f"{x)  does  in  fact  go  from  positive  to  negative  or  negative  to  positive  at  these 
points.  Note  that  it  is  possible  that  /"(a)  =  0  but  the  concavity  is  the  same  on  both  sides; 
f{x)=x'^aXx  —  Q  is  an  example. 

EXAMPLE  5.4.1  Describe  the  concavity  of /(x)  =  x^-x.  f'{x)  =  3x^-1,  f"{x)  =  6x. 
Since  /"(O)  =  0,  there  is  potentially  an  inflection  point  at  zero.  Since  /"(x)  >  0  when 
X  >  0  and  /"(x)  <  0  when  x  <  0  the  concavity  does  change  from  down  to  up  at  zero,  and 
the  curve  is  concave  down  for  all  x  <  0  and  concave  up  for  all  x  >  0.  □ 

Note  that  we  need  to  compute  and  analyze  the  second  derivative  to  understand  con- 
cavity, so  we  may  as  well  try  to  use  the  second  derivative  test  for  maxima  and  minima.  If 
for  some  reason  this  fails  we  can  then  try  one  of  the  other  tests. 


a 


a 
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Exercises  5.4- 


Describe  the  concavity  of  the  functions  in  1-18. 


1. 

y  — 

2  ^ 
X    —  X  — ^ 

2. 

7/ 

y 

—  2  -^?,r  —  =^ 

  ^     1     kJJU          JU  t 

3. 

V 

x^  -  9x^  +  24a;  =^ 

4. 

V 

=  X    —  Ix    +  O  =^ 

K 

KJ  • 

y  = 

Q'y^            zIt*^  — S- 

OJj                    '-tJj   7^ 

« 

y 

—  (rp-  —  ^  \  1 T  ^ 

7. 

y  = 

3x'^  -  ^ 

8. 

y 

=  sin  X  +  cos  X  =^ 

9. 

y  = 

4x  +  ^/l  -  X  ^ 

10. 

y 

=     +  l)/\/5x2  +  35 

11. 

y  = 

5  ^ 
X    —  X  ^ 

12. 

y 

=  6x  +  sin  3x  =^ 

13. 

y  = 

X  +  1/x  =^> 

14. 

y 

=      +  l/x  ^ 

15. 

y  = 

16. 

y 

=  tan^  X  =^ 

17. 

y  = 

2            •  2 

cos  X  —  sm  X  =^ 

18. 

y 

•   3  ^ 

=  sm  X 

19.  Identify  the  intervals  on  which  the  graph  of  the  function  f{x)  =  x^  —  Ax^  +  10  is  of  one  of 
these  four  shapes:  concave  up  and  increasing;  concave  up  and  decreasing;  concave  down  and 
increasing;  concave  down  and  decreasing.  =^> 

20.  Describe  the  concavity  oi  y  =  x'^  +  bx^  +  cx  +  d.  You  wiU  need  to  consider  different  cases, 
depending  on  the  values  of  the  coefficients. 

21.  Let  n  be  an  integer  greater  than  or  equal  to  two,  and  suppose  /  is  a  polynomial  of  degree 
n.  How  many  inflection  points  can  /  have?  Hint:  Use  the  second  derivative  test  and  the 
fundamental  theorem  of  algebra. 

5.5    Asymptotes  and  Other  Things  to  Look  For 

A  vertical  asymptote  is  a  place  where  the  function  becomes  infinite,  typically  because  the 
formula  for  the  function  has  a  denominator  that  becomes  zero.  For  example,  the  reciprocal 
function  f{x)  =  1/x  has  a  vertical  asymptote  at  x  =  0,  and  the  function  tan  x  has  a  vertical 
asymptote  at  a;  =  7r/2  (and  also  at  x  =  — 7r/2,  x  =  3n/2,  etc.).  Whenever  the  formula  for  a 
function  contains  a  denominator  it  is  worth  looking  for  a  vertical  asymptote  by  checking  to 
see  if  the  denominator  can  ever  be  zero,  and  then  checking  the  limit  at  such  points.  Note 
that  there  is  not  always  a  vertical  asymptote  where  the  derivative  is  zero:  /(x)  =  (sinx)/x 
has  a  zero  denominator  at  x  =  0,  but  since  lim(sinx)/x  =  1  there  is  no  asymptote  there. 

A  horizontal  asymptote  is  a  horizontal  line  to  which  /(x)  gets  closer  and  closer  as  x 
approaches  oo  (or  as  x  approaches  — oo).  For  example,  the  reciprocal  function  has  the 
X-axis  for  a  horizontal  asymptote.  Horizontal  asymptotes  can  be  identified  by  computing 
the  limits  lim  /(x)  and    lim   /(x).  Since  lim  1/x  =    lim   1/x  =  0,  the  line  y  =  0  (that 

x—^oo  x—^  —  oo  X— >oo  X— >  — oo 

is,  the  X-axis)  is  a  horizontal  asymptote  in  both  directions. 

Some  functions  have  asymptotes  that  are  neither  horizontal  nor  vertical,  but  some 
other  line.  Such  asymptotes  are  somewhat  more  difficult  to  identify  and  we  will  ignore 
them. 
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If  the  domain  of  the  function  does  not  extend  out  to  infinity,  we  should  also  ask 
what  happens  as  x  approaches  the  boundary  of  the  domain.  For  example,  the  function 
y  =  f{x)  =  Ijyjr^  —  x'^  has  domain  —r<x<r,  and  y  becomes  infinite  as  x  approaches 
either  r  or  — r.  In  this  case  we  might  also  identify  this  behavior  because  when  x  =  ±r  the 
denominator  of  the  function  is  zero. 

If  there  are  any  points  where  the  derivative  fails  to  exist  (a  cusp  or  corner),  then  we 
should  take  special  note  of  what  the  function  does  at  such  a  point. 

Finally,  it  is  worthwhile  to  notice  any  symmetry.  A  function  f{x)  that  has  the  same 
value  for  — a:  as  for  x,  i.e.,  f{—x)  =  /(x),  is  called  an  "even  function."  Its  graph  is 
symmetric  with  respect  to  the  y-axis.  Some  examples  of  even  functions  are:  x^  when  n 
is  an  even  number,  cosx,  and  sin^  x.  On  the  other  hand,  a  function  that  satisfies  the 
property  f{—x)  —  —f{x)  is  called  an  "odd  function."  Its  graph  is  symmetric  with  respect 
to  the  origin.  Some  examples  of  odd  functions  are:  x'^  when  n  is  an  odd  number,  sin  x,  and 
tan  X.  Of  course,  most  functions  are  neither  even  nor  odd,  and  do  not  have  any  particular 
symmetry. 

Exercises  5.5. 

Sketch  the  curves.  Identify  clearly  any  interesting  features,  including  local  maximum  and  mini- 
mum points,  inflection  points,  asymptotes,  and  intercepts. 


1. 

y  = 

2. 

y 

—  x^  —  3x^  —  9x 

3. 

y  = 

4. 

x' 

1,2    2          2  2 

+  x  y  =a  y  , 

5. 

y  = 

xe 

6. 

y 

=  (e^  +  e-=^)/2 

7. 

y  = 

e^^  cosx 

8. 

y 

=      —  sin  a; 

9. 

y  = 

X  1 

e  /x 

10. 

y 

=  4a;  +  Vl  —  X 

11. 

y  = 

(x  +  1)/V52;2  +35 

12. 

y 

5 

=  X    —  X 

13. 

y  = 

6x  +  sin  3x 

14. 

y 

=  X  +  1/x 

15. 

y  = 

x"^  +  \/x 

16. 

y 

=  + 

17. 

y  = 

tan^  X 

18. 

y 

2            .  2 

=  COS  X  —  sm  X 

19. 

y  = 

■  3 

sm  X 

20. 

y 

=  x{x^  +  1) 

21. 

y  = 

x^  +       +  9x 

22. 

y 

=  x/{x^  -  9) 

23. 

y  = 

x^/ix''  +  9) 

24. 

y 

=  2^/x  —  X 

25. 

y  = 

3sin(x)  —  sin^(a;),  for  x  E  [0,  27r] 

26. 

y 

=  ix-  l)/(x^) 

For  each  of  the  following  five  functions,  identify  any  vertical  and  horizontal  asymptotes,  and 
identify  intervals  on  which  the  function  is  concave  up  and  increasing;  concave  up  and  decreasing; 
concave  down  and  increasing;  concave  down  and  decreasing. 

27.  f{e)  =  seci9) 

28.  f{x)  =  l/{l  +  x^) 

29.  /(a;)  =  (x  -  3)/(2x  -  2) 
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30.  fix)  =  1/(1  -  x") 

31.  fix)  =  1  +  1/ix^) 

32.  Let  f{x)  =  l/ix^  —  a^),  where  a  >  0.  Find  any  vertical  and  horizontal  asymptotes  and 
the  intervals  upon  which  the  given  function  is  concave  up  and  increasing;  concave  up  and 
decreasing;  concave  down  and  increasing;  concave  down  and  decreasing.  Discuss  how  the 
value  of  a  affects  these  features. 


6 

Applications  of  the  Derivative 


6.1  Optimization 

Many  important  applied  problems  involve  finding  the  best  way  to  accomplish  some  task. 
Often  this  involves  finding  the  maximum  or  minimum  value  of  some  function:  the  minimum 
time  to  make  a  certain  journey,  the  minimum  cost  for  doing  a  task,  the  maximum  power 
that  can  be  generated  by  a  device,  and  so  on.  Many  of  these  problems  can  be  solved  by 
finding  the  appropriate  function  and  then  using  techniques  of  calculus  to  find  the  maximum 
or  the  minimum  value  required. 

Generally  such  a  problem  will  have  the  following  mathematical  form:  Find  the  largest 
(or  smallest)  value  of  f{x)  when  a  <  x  <  b.  Sometimes  a  or  6  are  infinite,  but  frequently 
the  real  world  imposes  some  constraint  on  the  values  that  x  may  have. 

Such  a  problem  differs  in  two  ways  from  the  local  maximum  and  minimum  problems 
we  encountered  when  graphing  functions:  We  are  interested  only  in  the  function  between 
a  and  b,  and  we  want  to  know  the  largest  or  smallest  value  that  f{x)  takes  on,  not  merely 
values  that  are  the  largest  or  smallest  in  a  small  interval.  That  is,  we  seek  not  a  local 
maximum  or  minimum  but  a  global  maximum  or  minimum,  sometimes  also  called  an 
absolute  maximum  or  minimum. 

Any  global  maximum  or  minimum  must  of  course  be  a  local  maximum  or  minimum. 
If  we  find  all  possible  local  extrema,  then  the  global  maximum,  if  it  exists,  must  be  the 
largest  of  the  local  maxima  and  the  global  minimum,  if  it  exists,  must  be  the  smallest  of 
the  local  minima.  We  already  know  where  local  extrema  can  occur:  only  at  those  points 
at  which  f'{x)  is  zero  or  undefined.  Actually,  there  are  two  additional  points  at  which  a 
maximum  or  minimum  can  occur  if  the  endpoints  a  and  b  are  not  infinite,  namely,  at  a 
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-2 


1 


Figure  6.1.1     The  function  f{x)  =      restricted  to  [—2, 1] 


and  h.  We  have  not  previously  considered  such  points  because  we  have  not  been  interested 
in  Umiting  a  function  to  a  smaU  interval.  An  example  should  make  this  clear. 


[—2, 1],  shown  in  figure  6.1.1.  We  compute  f  {x)  =  2x,  which  is  zero  at  a;  =  0  and  is  always 
defined. 

Since  /'(I)  =  2  we  would  not  normally  flag  a;  =  1  as  a  point  of  interest,  but  it  is  clear 
from  the  graph  that  when  f{x)  is  restricted  to  [—2, 1]  there  is  a  local  maximum  at  x  =  1. 
Likewise  we  would  not  normally  pay  attention  to  x  =  —2,  but  since  we  have  truncated  / 
at  —2  we  have  introduced  a  new  local  maximum  there  as  well.  In  a  technical  sense  nothing 
new  is  going  on  here:  When  we  truncate  /  we  actually  create  a  new  function,  let's  call  it 
g,  that  is  defined  only  on  the  interval  [—2, 1].  If  we  try  to  compute  the  derivative  of  this 
new  function  we  actually  find  that  it  does  not  have  a  derivative  at  —2  or  1.  Why?  Because 
to  compute  the  derivative  at  1  we  must  compute  the  limit 


This  limit  does  not  exist  because  when  Ax  >  0,  g{l  +  Ax)  is  not  defined.  It  is  simpler, 
however,  simply  to  remember  that  we  must  always  check  the  endpoints. 

So  the  function  g,  that  is,  /  restricted  to  [—2, 1],  has  one  critical  value  and  two  finite 
endpoints,  any  of  which  might  be  the  global  maximum  or  minimum.  We  could  first  deter- 
mine which  of  these  are  local  maximum  or  minimum  points  (or  neither);  then  the  largest 
local  maximum  must  be  the  global  maximum  and  the  smallest  local  minimum  must  be  the 
global  minimum.  It  is  usually  easier,  however,  to  compute  the  value  of  /  at  every  point 
at  which  the  global  maximum  or  minimum  might  occur;  the  largest  of  these  is  the  global 
maximum,  the  smallest  is  the  global  minimum. 

So  we  compute  /(— 2)  =  4,  /(O)  =  0,  /(I)  =  1.  The  global  maximum  is  4  at  x  =  —2 
and  the  global  minimum  is  0  at  x  =  0.  □ 


EXAMPLE  6.1.1    Find  the  maximum  and  minimum  values  of  /(x)  =  x^  on  the  interval 


lim 

Ax^-O 


g{l  +  Ax)-g{l) 


Ax 
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It  is  possible  that  there  is  no  global  maximum  or  minimum.  It  is  difficult,  and  not 
particularly  useful,  to  express  a  complete  procedure  for  determining  whether  this  is  the 
case.  Generally,  the  best  approach  is  to  gain  enough  understanding  of  the  shape  of  the 
graph  to  decide.  Fortunately,  only  a  rough  idea  of  the  shape  is  usually  needed. 

There  are  some  particularly  nice  cases  that  are  easy.  A  continuous  function  on  a  closed 
interval  [a,  b]  always  has  both  a  global  maximum  and  a  global  minimum,  so  examining  the 
critical  values  and  the  endpoints  is  enough: 

THEOREM  6.1.2  Extreme  value  theorem  If  /  is  continuous  on  a  closed  interval 
[a,  b],  then  it  has  both  a  minimum  and  a  maximum  point.  That  is,  there  are  real  numbers 
c  and  d  in  [a,  b]  so  that  for  every  x  in  [a,  6],  /(x)  <  /(c)  and  /(x)  >  f{d).  ■ 

Another  easy  case:  If  a  function  is  continuous  and  has  a  single  critical  value,  then  if 
there  is  a  local  maximum  at  the  critical  value  it  is  a  global  maximum,  and  if  it  is  a  local 
minimum  it  is  a  global  minimum.  There  may  also  be  a  global  minimum  in  the  first  case, 
or  a  global  maximum  in  the  second  case,  but  that  will  generally  require  more  effort  to 
determine. 

EXAMPLE  6.1.3  Let  /(x)  =  -x^  +  4x  -  3.  Find  the  maximum  value  of  /(x)  on  the 
interval  [0, 4].  First  note  that  /'(x)  =  —2x+4  =  0  when  x  =  2,  and  /(2)  =  1.  Next  observe 
that  /'(x)  is  defined  for  all  x,  so  there  are  no  other  critical  values.  Finally,  /(O)  =  —3  and 
/(4)  =  —3.  The  largest  value  of  /(x)  on  the  interval  [0,4]  is  /(2)  =  1.  □ 

EXAMPLE  6.1.4  Let  /(x)  =  -x^  +  4x  -  3.  Find  the  maximum  value  of  /(x)  on  the 
interval  [—1,1]. 

First  note  that  /'(x)  =  — 2x  +  4  =  0  when  x  =  2.  But  x  =  2  is  not  in  the  interval,  so 
we  don't  use  it.  Thus  the  only  two  points  to  be  checked  are  the  endpoints;  /(—I)  =  —8 
and  /(I)  =  0.  So  the  largest  value  of  /(x)  on  [—1, 1]  is  /(I)  =  0.  □ 

EXAMPLE  6.1.5  Find  the  maximum  and  minimum  values  of  the  function  /(x)  = 
7+  |x  —  2|  for  X  between  1  and  4  inclusive.  The  derivative  /'(x)  is  never  zero,  but  /'(x)  is 
undefined  at  x  =  2,  so  we  compute  /(2)  =  7.  Checking  the  end  points  we  get  /(I)  =  8  and 
/(4)  =  9.  The  smallest  of  these  numbers  is  /(2)  =  7,  which  is,  therefore,  the  minimum 
value  of  /(x)  on  the  interval  1  <  x  <  4,  and  the  maximum  is  /(4)  =  9.  □ 

EXAMPLE  6.1.6  Find  all  local  maxima  and  minima  for  /(x)  =  x"^  —  x,  and  deter- 
mine whether  there  is  a  global  maximum  or  minimum  on  the  open  interval  (—2,2).  In 
example  5.1.2  we  found  a  local  maximum  at  (—a/3/3,  2 a/3/9)  and  a  local  minimum  at 
{V3/3,  — 2a/3/9).  Since  the  endpoints  are  not  in  the  interval  (—2,2)  they  cannot  be  con- 
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Figure  6.1.2     f{x)  =      —  x 


sidered.  Is  the  lone  local  maximum  a  global  maximum?  Here  we  must  look  more  closely  at 
the  graph.  We  know  that  on  the  closed  interval  [— v^/3]  there  is  a  global  maximum 
at  X  =  — "\/3/3  and  a  global  minimum  at  x  =  a/3/3.  So  the  question  becomes:  what  hap- 
pens between  —2  and  —a/3/3,  and  between  v^/3  and  2?  Since  there  is  a  local  minimum 
at  X  =  "\/3/3,  the  graph  must  continue  up  to  the  right,  since  there  are  no  more  critical 
values.  This  means  no  value  of  /  will  be  less  than  —2V3/9  between  v^/3  and  2,  but  it 
says  nothing  about  whether  we  might  find  a  value  larger  than  the  local  maximum  2\/3/9. 
How  can  we  tell?  Since  the  function  increases  to  the  right  of  a/3/3,  we  need  to  know  what 
the  function  values  do  "close  to"  2.  Here  the  easiest  test  is  to  pick  a  number  and  do  a 
computation  to  get  some  idea  of  what's  going  on.  Since  /(1. 9)  =  4.959  >  2v^/9,  there 
is  no  global  maximum  at  —V3/3,  and  hence  no  global  maximum  at  all.  (How  can  we  tell 
that  4.959  >  2v^/9?  We  can  use  a  calculator  to  approximate  the  right  hand  side;  if  it  is 
not  even  close  to  4.959  we  can  take  this  as  decisive.  Since  2a/3/9  ~  0.3849,  there's  really 
no  question.  Funny  things  can  happen  in  the  rounding  done  by  computers  and  calculators, 
however,  so  we  might  be  a  little  more  careful,  especially  if  the  values  come  out  quite  close. 
In  this  case  we  can  convert  the  relation  4.959  >  2^3/9  into  (9/2)4.959  >  \/3  and  ask 
whether  this  is  true.  Since  the  left  side  is  clearly  larger  than  4  ■  4  which  is  clearly  larger 
than  a/3,  this  settles  the  question.) 

A  similar  analysis  shows  that  there  is  also  no  global  minimum.  The  graph  of  /(x)  on 
(—2,  2)  is  shown  in  figure  6.1.2.  □ 

EXAMPLE  6.1.7    Of  all  rectangles  of  area  100,  which  has  the  smallest  perimeter? 

First  we  must  translate  this  into  a  purely  mathematical  problem  in  which  we  want  to 
find  the  minimum  value  of  a  function.  If  x  denotes  one  of  the  sides  of  the  rectangle,  then 
the  adjacent  side  must  be  100/x  (in  order  that  the  area  be  100).  So  the  function  we  want 
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to  minimize  is 

f(x)  =  2x  +  2  — 

X 

since  the  perimeter  is  twice  the  length  plus  twice  the  width  of  the  rectangle.  Not  all  values 
of  X  make  sense  in  this  problem:  lengths  of  sides  of  rectangles  must  be  positive,  so  a;  >  0. 
If  a;  >  0  then  so  is  100/x,  so  we  need  no  second  condition  on  x. 

We  next  find  f'{x)  and  set  it  equal  to  zero:  0  =  f'{x)  =  2  —  200/x'^.  Solving  f'{x)  =  0 
for  X  gives  us  x  =  ±10.  We  are  interested  only  in  x  >  0,  so  only  the  value  x  =  10  is  of 
interest.  Since  f'{x)  is  defined  everywhere  on  the  interval  (0,  oo),  there  are  no  more  critical 
values,  and  there  are  no  endpoints.  Is  there  a  local  maximum,  minimum,  or  neither  at 
X  =  10?  The  second  derivative  is  f"{x)  =  400/a;^,  and  /"(lO)  >  0,  so  there  is  a  local 
minimum.  Since  there  is  only  one  critical  value,  this  is  also  the  global  minimum,  so  the 
rectangle  with  smallest  perimeter  is  the  10  x  10  square.  □ 

EXAMPLE  6.1.8  You  want  to  sell  a  certain  number  n  of  items  in  order  to  maximize 
your  profit.  Market  research  tells  you  that  if  you  set  the  price  at  $1.50,  you  will  be  able 
to  sell  5000  items,  and  for  every  10  cents  you  lower  the  price  below  $1.50  you  will  be  able 
to  sell  another  1000  items.  Suppose  that  your  fixed  costs  ("start-up  costs")  total  $2000, 
and  the  per  item  cost  of  production  ("marginal  cost")  is  $0.50.  Find  the  price  to  set  per 
item  and  the  number  of  items  sold  in  order  to  maximize  profit,  and  also  determine  the 
maximum  profit  you  can  get. 

The  first  step  is  to  convert  the  problem  into  a  function  maximization  problem.  Since 
we  want  to  maximize  profit  by  setting  the  price  per  item,  we  should  look  for  a  function 
P{x)  representing  the  profit  when  the  price  per  item  is  x.  Profit  is  revenue  minus  costs,  and 
revenue  is  number  of  items  sold  times  the  price  per  item,  so  we  get  P  =  nx  —  2000  —  0.50n. 
The  number  of  items  sold  is  itself  a  function  of  n  =  5000  +  1000(1.5  —  a;)/0.10,  because 
(1.5  —  a;)/0.10  is  the  number  of  multiples  of  10  cents  that  the  price  is  below  $1.50.  Now 
we  substitute  for  n  in  the  profit  function: 

P{x)  =  (5000  +  1000(1.5  -  x)/Q.lQ)x  -  2000  -  0.5(5000  +  1000(1.5  -  x)/0.10) 
=  -lOOOOx^  +  25000a;  -  12000 

We  want  to  know  the  maximum  value  of  this  function  when  x  is  between  0  and  1.5.  The 
derivative  is  P'{x)  =  -20000a;  +  25000,  which  is  zero  when  x  =  1.25.  Since  P"{x)  = 
—20000  <  0,  there  must  be  a  local  maximum  at  a;  =  1.25,  and  since  this  is  the  only  critical 
value  it  must  be  a  global  maximum  as  well.  (Alternately,  we  could  compute  P{0)  = 
-12000,  P(1.25)  =  3625,  and  P(1.5)  =  3000  and  note  that  P(1.25)  is  the  maximum  of 
these.)  Thus  the  maximum  profit  is  $3625,  attained  when  we  set  the  price  at  $1.25  and 
sell  7500  items.  □ 
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Figure  6.1.3     Rectangle  in  a  parabola. 

EXAMPLE  6.1.9  Find  the  largest  rectangle  (that  is,  the  rectangle  with  largest  area) 
that  fits  inside  the  graph  of  the  parabola  y  =  below  the  line  y  =  a  (a  is  an  unspecified 
constant  value),  with  the  top  side  of  the  rectangle  on  the  horizontal  line  y  =  a;  see 
figure  6.1.3.) 

We  want  to  find  the  maximum  value  of  some  function  A{x)  representing  area.  Perhaps 
the  hardest  part  of  this  problem  is  deciding  what  x  should  represent.  The  lower  right 
corner  of  the  rectangle  is  at  (x,x^),  and  once  this  is  chosen  the  rectangle  is  completely 
determined.  So  we  can  let  the  x  in  A{x)  be  the  x  of  the  parabola  f{x)  =  x"^ .  Then  the 
area  is  A{x)  =  {2x){a  —  x'^)  =  —2x^  +  2ax.  We  want  the  maximum  value  of  A(x)  when  x  is 
in  [0,  y/a].  (You  might  object  to  allowing  a;  =  0  or  a:  =  a/o,  since  then  the  "rectangle"  has 
either  no  width  or  no  height,  so  is  not  "really"  a  rectangle.  But  the  problem  is  somewhat 
easier  if  we  simply  allow  such  rectangles,  which  have  zero  area.) 

Setting  0  =  A'{x)  =  6x^  +  2a  we  get  x  =  y/a/3  as  the  only  critical  value.  Testing  this 
and  the  two  endpoints,  we  have  A{0)  =  A{y/a)  =  0  and  A{y/a/3)  =  (4/9) VSa^/^.  The 
maximum  area  thus  occurs  when  the  rectangle  has  dimensions  2\/a/3  x  (2/3)a.  □ 

EXAMPLE  6.1.10  If  you  fit  the  largest  possible  cone  inside  a  sphere,  what  fraction 
of  the  volume  of  the  sphere  is  occupied  by  the  cone?  (Here  by  "cone"  we  mean  a  right 
circular  cone,  i.e.,  a  cone  for  which  the  base  is  perpendicular  to  the  axis  of  symmetry,  and 
for  which  the  cross-section  cut  perpendicular  to  the  axis  of  symmetry  at  any  point  is  a 
circle.) 

Let  R  be  the  radius  of  the  sphere,  and  let  r  and  h  be  the  base  radius  and  height  of 
the  cone  inside  the  sphere.  What  we  want  to  maximize  is  the  volume  of  the  cone:  7rr^/i/3. 
Here  i?  is  a  fixed  value,  but  r  and  h  can  vary.  Namely,  we  could  choose  r  to  be  as  large  as 
possible — equal  to  R — by  taking  the  height  equal  to  R;  or  we  could  make  the  cone's  height 
h  larger  at  the  expense  of  making  r  a  little  less  than  R.  See  the  cross-section  depicted  in 
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{h  -  R,r) 

Figure  6.1.4     Cone  in  a  sphere. 


figure  6.1.4.  We  have  situated  the  picture  in  a  convenient  way  relative  to  the  x  and  y  axes, 
namely,  with  the  center  of  the  sphere  at  the  origin  and  the  vertex  of  the  cone  at  the  far 
left  on  the  x-axis. 

Notice  that  the  function  we  want  to  maximize,  7rr^/i/3,  depends  on  two  variables.  This 
is  frequently  the  case,  but  often  the  two  variables  are  related  in  some  way  so  that  "really" 
there  is  only  one  variable.  So  our  next  step  is  to  find  the  relationship  and  use  it  to  solve 
for  one  of  the  variables  in  terms  of  the  other,  so  as  to  have  a  function  of  only  one  variable 
to  maximize.  In  this  problem,  the  condition  is  apparent  in  the  figure:  the  upper  corner  of 
the  triangle,  whose  coordinates  are  [h  —  R,r),  must  be  on  the  circle  of  radius  R.  That  is, 

{h-Rf  +  r''  =  R\ 

We  can  solve  for  h  in  terms  of  r  or  for  r  in  terms  of  h.  Either  involves  taking  a  square 
root,  but  we  notice  that  the  volume  function  contains  r^,  not  r  by  itself,  so  it  is  easiest  to 
solve  for  r   directly:  r  =  R  —  [h  ~  R)  .  Then  we  substitute  the  result  into  nr  h/3: 

V{h)  =  n{R'^  -  {h-Rf)h/3 

=  +  -nh^R 

3  3 

We  want  to  maximize  V{h)  when  h  is  between  0  and  2R.  Now  we  solve  0  =  f'{h)  — 
-TTh^  +  {4/3)7ThR,  getting  h  =  0  or  h  =  4R/3.  We  compute  V{0)  =  V{2R)  =  0  and 
V{4R/3)  =  (32/81)7ri?^  The  maximum  is  the  latter;  since  the  volume  of  the  sphere  is 
(4/3)7r-R'^,  the  fraction  of  the  sphere  occupied  by  the  cone  is 

(32/81)7ri?3       8  ^ 
(4/3)7ri?3  -^-30%. 


□ 
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EXAMPLE  6.1.11  You  are  making  cylindrical  containers  to  contain  a  given  volume. 
Suppose  that  the  top  and  bottom  are  made  of  a  material  that  is  N  times  as  expensive 
(cost  per  unit  area)  as  the  material  used  for  the  lateral  side  of  the  cylinder.  Find  (in  terms 
of  N)  the  ratio  of  height  to  base  radius  of  the  cylinder  that  minimizes  the  cost  of  making 
the  containers. 

Let  us  first  choose  letters  to  represent  various  things:  h  for  the  height,  r  for  the  base 
radius,  V  for  the  volume  of  the  cylinder,  and  c  for  the  cost  per  unit  area  of  the  lateral  side 
of  the  cylinder;  V  and  c  are  constants,  h  and  r  are  variables.  Now  we  can  write  the  cost 
of  materials: 

c{2nrh)  +Nc{27Tr^). 

Again  we  have  two  variables;  the  relationship  is  provided  by  the  fixed  volume  of  the 
cylinder:  V  =  nr'^h.  We  use  this  relationship  to  eliminate  h  (we  could  eliminate  r,  but  it's 
a  little  easier  if  we  eliminate  h,  which  appears  in  only  one  place  in  the  above  formula  for 
cost).  The  result  is 


V 


2cV 


f{r)  =  2c7rr — ^  +  2A"c7rr^  =  h  2A"c7rr^. 


We  want  to  know  the  minimum  value  of  this  function  when  r  is  in  (0,oo).  We  now  set 
0  =  f'{r)  =  -2cV/r^  +  ANcttt,  giving  r  =  ^V/{2Ntt).  Since  f"{r)  =  AcV/r^  +  ANctt 
is  positive  when  r  is  positive,  there  is  a  local  minimum  at  the  critical  value,  and  hence  a 
global  minimum  since  there  is  only  one  critical  value. 
Finally,  since  h  =  V/(7rr^), 


V 


h  V 


r      7rr3  7r(F/(2iV7r)) 


27V, 


so  the  minimum  cost  occurs  when  the  height  h  is  2N  times  the  radius.  If,  for  example, 
there  is  no  difference  in  the  cost  of  materials,  the  height  is  twice  the  radius  (or  the  height 
is  equal  to  the  diameter).  □ 


D  B  C 

Figure  6.1.5     Minimizing  travel  time. 
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EXAMPLE  6.1.12  Suppose  you  want  to  reach  a  point  A  that  is  located  across  the 
sand  from  a  nearby  road  (see  figure  6.1.5).  Suppose  that  the  road  is  straight,  and  b  is  the 
distance  from  A  to  the  closest  point  C  on  the  road.  Let  v  be  your  speed  on  the  road,  and 
let  w,  which  is  less  than  v,  be  your  speed  on  the  sand.  Right  now  you  are  at  the  point 
D,  which  is  a  distance  a  from  C.  At  what  point  B  should  you  turn  off  the  road  and  head 
across  the  sand  in  order  to  minimize  your  travel  time  to  A7 

Let  X  be  the  distance  short  of  C  where  you  turn  off,  i.e.,  the  distance  from  B  to  C.  We 
want  to  minimize  the  total  travel  time.  Recall  that  when  traveling  at  constant  velocity, 
time  is  distance  divided  by  velocity. 

You  travel  the  distance  DB  at  speed  v,  and  then  the  distance  BA  at  speed  w.  Since 
DB  =  a  —  X  and,  by  the  Pythagorean  theorem,  BA  =  \/x^~-\-l^,  the  total  time  for  the 
trip  is 

„,  .      a  — X     \/ x'^  +  b"^ 
/  X  =   H  . 

V  w 

We  want  to  find  the  minimum  value  of  /  when  x  is  between  0  and  a.  As  usual  we  set 
f'{x)  =  0  and  solve  for  x: 

0  =  /'(x)  =  --  + 


V     w\/x^  +  62 
w^/x'^  +  b"^  =  vx 

2/    2    I   t2\         2  2 
W  [X    +  b  )  =  V  X 

2i.2       12  2\  2 

W  b    =  [V    —  W  )X 

wb 


X 


;2 

Notice  that  a  does  not  appear  in  the  last  expression,  but  a  is  not  irrelevant,  since  we 


are  interested  only  in  critical  values  that  are  in  [0,a],  and  wb/yv'^  —  w"^  is  either  in  this 
interval  or  not.  If  it  is,  we  can  use  the  second  derivative  to  test  it: 

u2 


(x2  +  62)3/2^^- 

Since  this  is  always  positive  there  is  a  local  minimum  at  the  critical  point,  and  so  it  is  a 
global  minimum  as  well. 

If  the  critical  value  is  not  in  [0,  a]  it  is  larger  than  a.  In  this  case  the  minimum  must 
occur  at  one  of  the  endpoints.  We  can  compute 

/(0)  =  ^  +  ^ 

V  W 
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but  it  is  difficult  to  determine  which  of  these  is  smaller  by  direct  comparison.  If,  as  is 
likely  in  practice,  we  know  the  values  of  v,  w,  a,  and  b,  then  it  is  easy  to  determine  this. 
With  a  little  cleverness,  however,  we  can  determine  the  minimum  in  general.  We  have  seen 
that  f"{x)  is  always  positive,  so  the  derivative  f'{x)  is  always  increasing.  We  know  that 


at  wb/  y  —  the  derivative  is  zero,  so  for  values  of  x  less  than  that  critical  value,  the 
derivative  is  negative.  This  means  that  /(O)  >  /(a),  so  the  minimum  occurs  when  x  =  a. 


So  the  upshot  is  this:  If  you  start  farther  away  from  C  than  wbj  y     —      then  you 


always  want  to  cut  across  the  sand  when  you  are  a  distance  wbj  y  v"^  —  w"^  from  point  C. 


Summary — Steps  to  solve  an  optimization  problem. 

1.  Decide  what  the  variables  are  and  what  the  constants  are,  draw  a  diagram  if 
appropriate,  understand  clearly  what  it  is  that  is  to  be  maximized  or  minimized. 

2.  Write  a  formula  for  the  function  for  which  you  wish  to  find  the  maximum  or 
minimum. 

3.  Express  that  formula  in  terms  of  only  one  variable,  that  is,  in  the  form  f{x). 

4.  Set  /'(x)  =  0  and  solve.  Check  all  critical  values  and  endpoints  to  determine  the 
extreme  value. 

Exercises  6.1. 


Find  the  maximum  value  and  minimum  values  of  f{x)  for  x  in  [0,4].  Graph  f{x)  to  check 
your  answers. 

2.  Find  the  dimensions  of  the  rectangle  of  largest  area  having  fixed  perimeter  100.  =^ 

3.  Find  the  dimensions  of  the  rectangle  of  largest  area  having  fixed  perimeter  P.  =^ 

4.  A  box  with  square  base  and  no  top  is  to  hold  a  volume  100.  Find  the  dimensions  of  the  box 
that  requires  the  least  material  for  the  five  sides.  Also  find  the  ratio  of  height  to  side  of  the 
base.  =^> 

5.  A  box  with  square  base  is  to  hold  a  volume  200.  The  bottom  and  top  are  formed  by  folding 
in  flaps  from  all  four  sides,  so  that  the  bottom  and  top  consist  of  two  layers  of  cardboard. 
Find  the  dimensions  of  the  box  that  requires  the  least  material.  Also  find  the  ratio  of  height 
to  side  of  the  base.  =^> 

6.  A  box  with  square  base  and  no  top  is  to  hold  a  volume  V .  Find  (in  terms  of  V)  the  dimensions 
of  the  box  that  requires  the  least  material  for  the  five  sides.  Also  find  the  ratio  of  height  to 
side  of  the  base.  (This  ratio  will  not  involve  V .) 

7.  You  have  100  feet  of  fence  to  make  a  rectangular  play  area  alongside  the  wall  of  your  house. 
The  wall  of  the  house  bounds  one  side.  What  is  the  largest  size  possible  (in  square  feet)  for 
the  play  area?  =^ 


If  you  start  closer  than  this  to  C,  you  should  cut  directly  across  the  sand. 


□ 


for  X  <  3 
for  a;  >  3 
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8.  You  have  I  feet  of  fence  to  make  a  rectangular  play  area  alongside  the  wall  of  your  house. 
The  wall  of  the  house  bounds  one  side.  What  is  the  largest  size  possible  (in  square  feet)  for 
the  play  area?  =^ 

9.  Marketing  tells  you  that  if  you  set  the  price  of  an  item  at  $10  then  you  will  be  unable  to  sell 
it,  but  that  you  can  sell  500  items  for  each  dollar  below  $10  that  you  set  the  price.  Suppose 
your  fixed  costs  total  $3000,  and  your  marginal  cost  is  $2  per  item.  What  is  the  most  profit 
you  can  make?  =^ 

10.  Find  the  area  of  the  largest  rectangle  that  fits  inside  a  semicircle  of  radius  10  (one  side  of 
the  rectangle  is  along  the  diameter  of  the  semicircle).  =^> 

11.  Find  the  area  of  the  largest  rectangle  that  fits  inside  a  semicircle  of  radius  r  (one  side  of  the 
rectangle  is  along  the  diameter  of  the  semicircle). 

12.  For  a  cylinder  with  surface  area  50,  including  the  top  and  the  bottom,  find  the  ratio  of  height 
to  base  radius  that  maximizes  the  volume.  =^ 

13.  For  a  cylinder  with  given  surface  area  S,  including  the  top  and  the  bottom,  find  the  ratio  of 
height  to  base  radius  that  maximizes  the  volume.  =^ 

14.  You  want  to  make  cylindrical  containers  to  hold  1  liter  using  the  least  amount  of  construction 
material.  The  side  is  made  from  a  rectangular  piece  of  material,  and  this  can  be  done  with 
no  material  wasted.  However,  the  top  and  bottom  are  cut  from  squares  of  side  2r,  so  that 
2(2r)^  =  8r'^  of  material  is  needed  (rather  than  2iir'^ ,  which  is  the  total  area  of  the  top  and 
bottom).  Find  the  dimensions  of  the  container  using  the  least  amount  of  material,  and  also 
find  the  ratio  of  height  to  radius  for  this  container.  =^ 

15.  You  want  to  make  cylindrical  containers  of  a  given  volume  V  using  the  least  amount  of 
construction  material.  The  side  is  made  from  a  rectangular  piece  of  material,  and  this  can 
be  done  with  no  material  wasted.  However,  the  top  and  bottom  are  cut  from  squares  of  side 
2r,  so  that  2(2r)^  =  Sr^  of  material  is  needed  (rather  than  27rr^,  which  is  the  total  area  of 
the  top  and  bottom).  Find  the  optimal  ratio  of  height  to  radius. 

16.  Given  a  right  circular  cone,  you  put  an  upside-down  cone  inside  it  so  that  its  vertex  is  at  the 
center  of  the  base  of  the  larger  cone  and  its  base  is  parallel  to  the  base  of  the  larger  cone.  If 
you  choose  the  upside-down  cone  to  have  the  largest  possible  volume,  what  fraction  of  the 
volume  of  the  larger  cone  does  it  occupy?  (Let  H  and  R  be  the  height  and  base  radius  of 
the  larger  cone,  and  let  h  and  r  be  the  height  and  base  radius  of  the  smaller  cone.  Hint:  Use 
similar  triangles  to  get  an  equation  relating  h  and  r.) 

17.  In  example  6.1.12,  what  happens  \i  w  >v  (i.e.,  your  speed  on  sand  is  at  least  your  speed  on 
the  road)?  =^ 

18.  A  container  holding  a  fixed  volume  is  being  made  in  the  shape  of  a  cylinder  with  a  hemi- 
spherical top.  (The  hemispherical  top  has  the  same  radius  as  the  cylinder.)  Find  the  ratio  of 
height  to  radius  of  the  cylinder  which  minimizes  the  cost  of  the  container  if  (a)  the  cost  per 
unit  area  of  the  top  is  twice  as  great  as  the  cost  per  unit  area  of  the  side,  and  the  container 
is  made  with  no  bottom;  (b)  the  same  as  in  (a),  except  that  the  container  is  made  with  a 
circular  bottom,  for  which  the  cost  per  unit  area  is  1.5  times  the  cost  per  unit  area  of  the 
side.  =^ 

19.  A  piece  of  cardboard  is  1  meter  by  1/2  meter.  A  square  is  to  be  cut  from  each  corner  and 
the  sides  folded  up  to  make  an  open-top  box.  What  are  the  dimensions  of  the  box  with 
maximum  possible  volume?  =^ 
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20.  (a)  A  square  piece  of  cardboard  of  side  a  is  used  to  make  an  open-top  box  by  cutting  out 
a  small  square  from  each  corner  and  bending  up  the  sides.  How  large  a  square  should  be 
cut  from  each  corner  in  order  that  the  box  have  maximum  volume?  (b)  What  if  the  piece  of 
cardboard  used  to  make  the  box  is  a  rectangle  of  sides  a  and  5?  =^> 

21.  A  window  consists  of  a  rectangular  piece  of  clear  glass  with  a  semicircular  piece  of  colored 
glass  on  top;  the  colored  glass  transmits  only  1/2  as  much  light  per  unit  area  as  the  the  clear 
glass.  If  the  distance  from  top  to  bottom  (across  both  the  rectangle  and  the  semicircle)  is 
2  meters  and  the  window  may  be  no  more  than  1.5  meters  wide,  find  the  dimensions  of  the 
rectangular  portion  of  the  window  that  lets  through  the  most  light.  =^> 

22.  A  window  consists  of  a  rectangular  piece  of  clear  glass  with  a  semicircular  piece  of  colored 
glass  on  top.  Suppose  that  the  colored  glass  transmits  only  k  times  as  much  light  per  unit 
area  as  the  clear  glass  {k  is  between  0  and  1).  If  the  distance  from  top  to  bottom  (across 
both  the  rectangle  and  the  semicircle)  is  a  fixed  distance  H,  find  (in  terms  of  k)  the  ratio  of 
vertical  side  to  horizontal  side  of  the  rectangle  for  which  the  window  lets  through  the  most 
light.  =^ 

23.  You  are  designing  a  poster  to  contain  a  fixed  amount  A  of  printing  (measured  in  square 
centimeters)  and  have  margins  of  a  centimeters  at  the  top  and  bottom  and  b  centimeters  at 
the  sides.  Find  the  ratio  of  vertical  dimension  to  horizontal  dimension  of  the  printed  area  on 
the  poster  if  you  want  to  minimize  the  amount  of  posterboard  needed.  =^ 

24.  The  strength  of  a  rectangular  beam  is  proportional  to  the  product  of  its  width  w  times  the 
square  of  its  depth  d.  Find  the  dimensions  of  the  strongest  beam  that  can  be  cut  from  a 
cylindrical  log  of  radius  r.  =^> 


25.  What  fraction  of  the  volume  of  a  sphere  is  taken  up  by  the  largest  cylinder  that  can  be  fit 
inside  the  sphere?  =^> 

26.  The  U.S.  post  office  will  accept  a  box  for  shipment  only  if  the  sum  of  the  length  and  girth 
(distance  around)  is  at  most  108  in.  Find  the  dimensions  of  the  largest  acceptable  box  with 
square  front  and  back. 

27.  Find  the  dimensions  of  the  lightest  cylindrical  can  containing  0.25  liter  (=250  cm^)  if  the 
top  and  bottom  are  made  of  a  material  that  is  twice  as  heavy  (per  unit  area)  as  the  material 
used  for  the  side. 

28.  A  conical  paper  cup  is  to  hold  1/4  of  a  liter.  Find  the  height  and  radius  of  the  cone  which 
minimizes  the  amount  of  paper  needed  to  make  the  cup.  Use  the  formula  Tvr\/r^~+~h?  for 
the  area  of  the  side  of  a  cone.  =^ 

29.  A  conical  paper  cup  is  to  hold  a  fixed  volume  of  water.  Find  the  ratio  of  height  to  base  radius 
of  the  cone  which  minimizes  the  amount  of  paper  needed  to  make  the  cup.  Use  the  formula 
j^j,^y^i^2~^rj^  for  the  area  of  the  side  of  a  cone,  called  the  lateral  area  of  the  cone. 


Figure  6.1.6     Cutting  a  beam. 
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30.  If  you  fit  tfie  cone  with  the  largest  possible  surface  area  (lateral  area  plus  area  of  base)  into 
a  sphere,  what  percent  of  the  volume  of  the  sphere  is  occupied  by  the  cone?  =^> 

31.  Two  electrical  charges,  one  a  positive  charge  A  of  magnitude  a  and  the  other  a  negative 
charge  B  of  magnitude  b,  are  located  a  distance  c  apart.  A  positively  charged  particle  P  is 
situated  on  the  line  between  A  and  B.  Find  where  P  should  be  put  so  that  the  pull  away 
from  A  towards  B  is  minimal.  Here  assume  that  the  force  from  each  charge  is  proportional 
to  the  strength  of  the  source  and  inversely  proportional  to  the  square  of  the  distance  from 
the  source.  =^> 

32.  Find  the  fraction  of  the  area  of  a  triangle  that  is  occupied  by  the  largest  rectangle  that  can 
be  drawn  in  the  triangle  (with  one  of  its  sides  along  a  side  of  the  triangle).  Show  that  this 
fraction  does  not  depend  on  the  dimensions  of  the  given  triangle.  =^ 

33.  How  are  your  answers  to  Problem  9  affected  if  the  cost  per  item  for  the  x  items,  instead 
of  being  simply  $2,  decreases  below  $2  in  proportion  to  x  (because  of  economy  of  scale  and 
volume  discounts)  by  1  cent  for  each  25  items  produced? 

34.  You  are  standing  near  the  side  of  a  large  wading  pool  of  uniform  depth  when  you  see  a  child 
in  trouble.  You  can  run  at  a  speed  vi  on  land  and  at  a  slower  speed  V2  in  the  water.  Your 
perpendicular  distance  from  the  side  of  the  pool  is  a,  the  child's  perpendicular  distance  is  b, 
and  the  distance  along  the  side  of  the  pool  between  the  closest  point  to  you  and  the  closest 
point  to  the  child  is  c  (see  the  figure  below).  Without  stopping  to  do  any  calculus,  you 
instinctively  choose  the  quickest  route  (shown  in  the  figure)  and  save  the  child.  Our  purpose 
is  to  derive  a  relation  between  the  angle  9\  your  path  makes  with  the  perpendicular  to  the  side 
of  the  pool  when  you're  on  land,  and  the  angle  02  your  path  makes  with  the  perpendicular 
when  you're  in  the  water.  To  do  this,  let  x  be  the  distance  between  the  closest  point  to  you 
at  the  side  of  the  pool  and  the  point  where  you  enter  the  water.  Write  the  total  time  you 
run  (on  land  and  in  the  water)  in  terms  of  x  (and  also  the  constants  a,  b,  c,  vi,V2)-  Then  set 
the  derivative  equal  to  zero.  The  result,  called  "Snell's  law"  or  the  "law  of  refraction,"  also 
governs  the  bending  of  light  when  it  goes  into  water.  =^> 


x 

yc  —  x 

a 

Figure  6.1.7     Wading  pool  rescue. 

6.2    Related  Rates 

Suppose  we  have  two  variables  x  and  y  (in  most  problems  the  letters  will  be  different, 
but  for  now  let's  use  x  and  y)  which  are  both  changing  with  time.  A  "related  rates" 
problem  is  a  problem  in  which  we  know  one  of  the  rates  of  change  at  a  given  instant — say. 
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X  =  dx/dt — and  we  want  to  find  the  other  rate  y  =  dy/dt  at  that  instant.  (The  use  of 
X  to  mean  dx/dt  goes  back  to  Newton  and  is  still  used  for  this  purpose,  especially  by 
physicists.) 

If  y  is  written  in  terms  of  x,  i.e.,  y  =  /(x),  then  this  is  easy  to  do  using  the  chain  rule: 

dy      dy    dx  dy 

y  =  -r  —  ~r  '  ~r  =  ~r^- 
dt      dx    dt  dx 

That  is,  find  the  derivative  of  /(x),  plug  in  the  value  of  x  at  the  instant  in  question,  and 
multiply  by  the  given  value  of  x  =  dx/dt  to  get  y  =  dy/dt. 

EXAMPLE  6.2.1  Suppose  an  object  is  moving  along  a  path  described  hy  y  —  x^,  that 
is,  it  is  moving  on  a  parabolic  path.  At  a  particular  time,  say  t  =  5,  the  x  coordinate  is 
6  and  we  measure  the  speed  at  which  the  x  coordinate  of  the  object  is  changing  and  find 
that  dx/dt  =  3.  At  the  same  time,  how  fast  is  the  y  coordinate  changing? 

Using  the  chain  rule,  dy/dt  =  2x  ■  dx/dt.  At  t  =  5  we  know  that  x  =  6  and  dx/dt  =  3, 
so  dy/dt  =  2  ■  6  ■  3  =  36.  □ 

In  many  cases,  particularly  interesting  ones,  x  and  y  will  be  related  in  some  other  way, 
for  example  x  —  f{y),  or  F{x,y)  =  /c,  or  perhaps  F{x,y)  =  G{x,y),  where  F{x,y)  and 
G{x,y)  are  expressions  involving  both  variables.  In  all  cases,  you  can  solve  the  related 
rates  problem  by  taking  the  derivative  of  both  sides,  plugging  in  all  the  known  values 
(namely,  x,  y,  and  x),  and  then  solving  for  y. 

To  summarize,  here  are  the  steps  in  doing  a  related  rates  problem: 

1.  Decide  what  the  two  variables  are. 

2.  Find  an  equation  relating  them. 

3.  Take  d/dt  of  both  sides. 

4.  Plug  in  all  known  values  at  the  instant  in  question. 

5.  Solve  for  the  unknown  rate. 

EXAMPLE  6.2.2  A  plane  is  flying  directly  away  from  you  at  500  mph  at  an  altitude 
of  3  miles.  How  fast  is  the  plane's  distance  from  you  increasing  at  the  moment  when  the 
plane  is  flying  over  a  point  on  the  ground  4  miles  from  you? 

To  see  what's  going  on,  we  first  draw  a  schematic  representation  of  the  situation,  as 
in  figure  6.2.1. 

Because  the  plane  is  in  level  fiight  directly  away  from  you,  the  rate  at  which  x  changes 
is  the  speed  of  the  plane,  dx/dt  =  500.  The  distance  between  you  and  the  plane  is  y;  it 
is  dy/dt  that  we  wish  to  know.  By  the  Pythagorean  Theorem  we  know  that  x'^  +  9  =  y'^. 
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y\  ^ 

y 

3 

y 

Figure  6.2.1     Receding  airplane. 

Taking  the  derivative: 

We  are  interested  in  the  time  at  which  x  =  4;  at  this  time  we  know  that  4^  +  9  =  y^,  so 
y  =  5.  Putting  together  aU  the  information  we  get 

2(4)(500)=  2(5)y. 

Thus,  y  =  400  mph.  □ 

EXAMPLE  6.2.3  You  are  inflating  a  spherical  balloon  at  the  rate  of  7  cm'^/sec.  How 
fast  is  its  radius  increasing  when  the  radius  is  4  cm? 

Here  the  variables  are  the  radius  r  and  the  volume  V.  We  know  dV/dt,  and  we  want 
dr/dt.  The  two  variables  are  related  by  means  of  the  equation  V  =  Aivr^/S.  Taking  the 
derivative  of  both  sides  gives  dV/dt  =  Aivr'^f.  We  now  substitute  the  values  we  know  at 
the  instant  in  question:  7  =  AnA^r,  so  r  =  7/(647r)  cm/sec.  □ 

EXAMPLE  6.2.4  Water  is  poured  into  a  conical  container  at  the  rate  of  10  cm^/sec. 
The  cone  points  directly  down,  and  it  has  a  height  of  30  cm  and  a  base  radius  of  10  cm;  see 
figure  6.2.2.  How  fast  is  the  water  level  rising  when  the  water  is  4  cm  deep  (at  its  deepest 
point)? 

The  water  forms  a  conical  shape  within  the  big  cone;  its  height  and  base  radius  and 
volume  are  all  increasing  as  water  is  poured  into  the  container.  This  means  that  we  actually 
have  three  things  varying  with  time:  the  water  level  h  (the  height  of  the  cone  of  water), 
the  radius  r  of  the  circular  top  surface  of  water  (the  base  radius  of  the  cone  of  water),  and 
the  volume  of  water  V .  The  volume  of  a  cone  is  given  by  V  =  Trr^/i/S.  We  know  dV/dt, 
and  we  want  dh/dt.  At  first  something  seems  to  be  wrong:  we  have  a  third  variable  r 
whose  rate  we  don't  know. 

But  the  dimensions  of  the  cone  of  water  must  have  the  same  proportions  as  those  of 
the  container.  That  is,  because  of  similar  triangles,  r/h  =  10/30  so  r  =  /i/3.  Now  we  can 
eliminate  r  from  the  problem  entirely:  V  =  7r(/i/3)^/i/3  =  ixh^" /21.  We  take  the  derivative 
of  both  sides  and  plug  in  /i  =  4  and  dV/dt  =  10,  obtaining  10  =  {3n  -4^ / 27) (dh/dt).  Thus, 
dh/dt  =  90/(167r)  cm/sec.  □ 
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Figure  6.2.2     Conical  water  tank. 

EXAMPLE  6.2.5  A  swing  consists  of  a  board  at  the  end  of  a  10  ft  long  rope.  Think 
of  the  board  as  a  point  P  at  the  end  of  the  rope,  and  let  Q  be  the  point  of  attachment 
at  the  other  end.  Suppose  that  the  swing  is  directly  below  Q  at  time  t  =  0,  and  is  being 
pushed  by  someone  who  walks  at  6  ft/sec  from  left  to  right.  Find  (a)  how  fast  the  swing 
is  rising  after  1  sec;  (b)  the  angular  speed  of  the  rope  in  deg/sec  after  1  sec. 

We  start  out  by  asking:  What  is  the  geometric  quantity  whose  rate  of  change  we  know, 
and  what  is  the  geometric  quantity  whose  rate  of  change  we're  being  asked  about?  Note 
that  the  person  pushing  the  swing  is  moving  horizontally  at  a  rate  we  know.  In  other 
words,  the  horizontal  coordinate  of  P  is  increasing  at  6  ft/sec.  In  the  xy- plane  let  us 
make  the  convenient  choice  of  putting  the  origin  at  the  location  of  P  at  time  t  =  0,  i.e.,  a 
distance  10  directly  below  the  point  of  attachment.  Then  the  rate  we  know  is  dx/dt,  and 
in  part  (a)  the  rate  we  want  is  dy/dt  (the  rate  at  which  P  is  rising).  In  part  (b)  the  rate 
we  want  is  ^  =  d9/dt,  where  9  stands  for  the  angle  in  radians  through  which  the  swing  has 
swung  from  the  vertical.  (Actually,  since  we  want  our  answer  in  deg/sec,  at  the  end  we 
must  convert  dO/dt  from  rad/sec  by  multiplying  by  ISO/tt.) 

(a)  From  the  diagram  we  see  that  we  have  a  right  triangle  whose  legs  are  x  and  10  —  y, 
and  whose  hypotenuse  is  10.  Hence  +  (10  —  y)^  =  100.  Taking  the  derivative  of  both 
sides  we  obtain:  2xx  +  2(10  —  y){0  —  y)  =  0.  We  now  look  at  what  we  know  after  1  second, 
namely  x  =  6  (because  x  started  at  0  and  has  been  increasing  at  the  rate  of  6  ft/sec 
for  1  sec),  y  —  2  (because  we  get  10  —  y  =  8  from  the  Pythagorean  theorem  applied  to 
the  triangle  with  hypotenuse  10  and  leg  6),  and  x  =  6.  Putting  in  these  values  gives  us 
2  •  6  •  6  —  2  •  8y  =  0,  from  which  we  can  easily  solve  for  y:  y  =  4.5  ft/sec. 

(b)  Here  our  two  variables  are  x  and  6,  so  we  want  to  use  the  same  right  triangle  as 
in  part  (a),  but  this  time  relate  9  to  x.  Since  the  hypotenuse  is  constant  (equal  to  10), 
the  best  way  to  do  this  is  to  use  the  sine:  sin^  =  a;/10.  Taking  derivatives  we  obtain 
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Figure  6.2.3  Swing. 

(cos^)^  =  O.lx.  At  the  instant  in  question  {t  =  1  sec),  when  we  have  a  right  triangle  with 
sides  6-8-10,  cos^  =  8/10  and  a;  =  6.  Thus  (8/10)^  =  6/10,  i.e.,  ^  =  6/8  =  3/4  rad/sec, 
or  approximately  43  deg/sec.  n 

We  have  seen  that  sometimes  there  are  apparently  more  than  two  variables  that  change 
with  time,  but  in  reality  there  are  just  two,  as  the  others  can  be  expressed  in  terms  of 
just  two.  But  sometimes  there  really  are  several  variables  that  change  with  time;  as  long 
as  you  know  the  rates  of  change  of  all  but  one  of  them  you  can  find  the  rate  of  change  of 
the  remaining  one.  As  in  the  case  when  there  are  just  two  variables,  take  the  derivative 
of  both  sides  of  the  equation  relating  all  of  the  variables,  and  then  substitute  all  of  the 
known  values  and  solve  for  the  unknown  rate. 


EXAMPLE  6.2.6  A  road  running  north  to  south  crosses  a  road  going  east  to  west  at 
the  point  P.  Car  A  is  driving  north  along  the  first  road,  and  car  B  is  driving  east  along  the 
second  road.  At  a  particular  time  car  A  is  10  kilometers  to  the  north  of  P  and  traveling  at 
80  km/hr,  while  car  B  is  15  kilometers  to  the  east  of  P  and  traveling  at  100  km/hr.  How 
fast  is  the  distance  between  the  two  cars  changing? 


(0,a(t)) 


c{t) 


 ^"-^ 

(6(t),0) 


Figure  6.2.4     Cars  moving  apart. 
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Let  a{t)  be  the  distance  of  car  A  north  of  P  at  time  t,  and  b{t)  the  distance  of  car  B  east 
of  P  at  time  t,  and  let  c(t)  be  the  distance  from  car  A  to  car  B  at  time  t.  By  the  Pythagorean 
Theorem,  c{tf  =  a{tf  +  b{tf.  Taking  derivatives  we  get  2c{t)c{t)  =  2a{t)a' {t)  +  2b{t)b' (t), 
so 

ad  +  bb       aa  +  bb 
^~      c      ~        +  62  ■ 

Substituting  known  values  we  get: 

.     10.80  +  15.100      460  j^^^^^^^^ 


V102  +  152 

at  the  time  of  interest.  □ 

Notice  how  this  problem  differs  from  example  6.2.2.  In  both  cases  we  started  with  the 
Pythagorean  Theorem  and  took  derivatives  on  both  sides.  However,  in  example  6.2.2  one 
of  the  sides  was  a  constant  (the  altitude  of  the  plane),  and  so  the  derivative  of  the  square 
of  that  side  of  the  triangle  was  simply  zero.  In  this  example,  on  the  other  hand,  all  three 
sides  of  the  right  triangle  are  variables,  even  though  we  are  interested  in  a  specific  value 
of  each  side  of  the  triangle  (namely,  when  the  sides  have  lengths  10  and  15).  Make  sure 
that  you  understand  at  the  start  of  the  problem  what  are  the  variables  and  what  are  the 
constants. 


Exercises  6.2. 

1.  A  cylindrical  tank  standing  upright  (with  one  circular  base  on  the  ground)  has  radius  20 
cm.  How  fast  does  the  water  level  in  the  tank  drop  when  the  water  is  being  drained  at  25 
cm^/sec?  =^ 

2.  A  cylindrical  tank  standing  upright  (with  one  circular  base  on  the  ground)  has  radius  1 
meter.  How  fast  does  the  water  level  in  the  tank  drop  when  the  water  is  being  drained  at  3 
liters  per  second?  =^ 

3.  A  ladder  13  meters  long  rests  on  horizontal  ground  and  leans  against  a  vertical  wall.  The 
foot  of  the  ladder  is  pulled  away  from  the  wall  at  the  rate  of  0.6  m/sec.  How  fast  is  the  top 
sliding  down  the  wall  when  the  foot  of  the  ladder  is  5  m  from  the  wall?  =^ 

4.  A  ladder  13  meters  long  rests  on  horizontal  ground  and  leans  against  a  vertical  wall.  The 
top  of  the  ladder  is  being  pulled  up  the  wall  at  0.1  meters  per  second.  How  fast  is  the  foot 
of  the  ladder  approaching  the  wall  when  the  foot  of  the  ladder  is  5  m  from  the  wall?  =^ 

5.  A  rotating  beacon  is  located  2  miles  out  in  the  water.  Let  A  be  the  point  on  the  shore  that 
is  closest  to  the  beacon.  As  the  beacon  rotates  at  10  rev/min,  the  beam  of  light  sweeps  down 
the  shore  once  each  time  it  revolves.  Assume  that  the  shore  is  straight.  How  fast  is  the  point 
where  the  beam  hits  the  shore  moving  at  an  instant  when  the  beam  is  lighting  up  a  point  2 
miles  along  the  shore  from  the  point  A? 

6.  A  baseball  diamond  is  a  square  90  ft  on  a  side.  A  player  runs  from  first  base  to  second  base 
at  15  ft/sec.  At  what  rate  is  the  player's  distance  from  third  base  decreasing  when  she  is 
half  way  from  first  to  second  base?  =^> 
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7.  Sand  is  poured  onto  a  surface  at  15  cm^/sec,  forming  a  conical  pile  whose  base  diameter  is 
always  equal  to  its  altitude.  How  fast  is  the  altitude  of  the  pile  increasing  when  the  pile  is  3 
cm  high?  =^ 

8.  A  boat  is  pulled  in  to  a  dock  by  a  rope  with  one  end  attached  to  the  front  of  the  boat  and 
the  other  end  passing  through  a  ring  attached  to  the  dock  at  a  point  5  ft  higher  than  the 
front  of  the  boat.  The  rope  is  being  pulled  through  the  ring  at  the  rate  of  0.6  ft/sec.  How 
fast  is  the  boat  approaching  the  dock  when  13  ft  of  rope  are  out?  =^ 

9.  A  balloon  is  at  a  height  of  50  meters,  and  is  rising  at  the  constant  rate  of  5  m/sec.  A  bicyclist 
passes  beneath  it,  traveling  in  a  straight  line  at  the  constant  speed  of  10  m/sec.  How  fast  is 
the  distance  between  the  bicyclist  and  the  balloon  increasing  2  seconds  later? 

10.  A  pyramid-shaped  vat  has  square  cross-section  and  stands  on  its  tip.  The  dimensions  at  the 
top  are  2  m  x  2  m,  and  the  depth  is  5  m.  If  water  is  flowing  into  the  vat  at  3  m^/min,  how 
fast  is  the  water  level  rising  when  the  depth  of  water  (at  the  deepest  point)  is  4  m?  Note: 
the  volume  of  any  "conical"  shape  (including  pyramids)  is  (1/3) (height) (area  of  base).  =^> 

11.  The  sun  is  rising  at  the  rate  of  1/4  deg/min,  and  appears  to  be  climbing  into  the  sky 
perpendicular  to  the  horizon,  as  depicted  in  figure  6.2.5.  How  fast  is  the  shadow  of  a  200 
meter  building  shrinking  at  the  moment  when  the  shadow  is  500  meters  long? 

12.  The  sun  is  setting  at  the  rate  of  1/4  deg/min,  and  appears  to  be  dropping  perpendicular  to 
the  horizon,  as  depicted  in  figure  6.2.5.  How  fast  is  the  shadow  of  a  25  meter  wall  lengthening 
at  the  moment  when  the  shadow  is  50  meters  long?  =^ 


Figure  6.2.5     Sunrise  or  sunset. 

13.  The  trough  shown  in  figure  6.2.6  is  constructed  by  fastening  together  three  slabs  of  wood  of 
dimensions  10  ft  x  1  ft,  and  then  attaching  the  construction  to  a  wooden  wall  at  each  end. 
The  angle  9  was  originally  30°,  but  because  of  poor  construction  the  sides  are  collapsing. 
The  trough  is  full  of  water.  At  what  rate  (in  ft'^/sec)  is  the  water  spilling  out  over  the  top 
of  the  trough  if  the  sides  have  each  fallen  to  an  angle  of  45°,  and  are  collapsing  at  the  rate 
of  1°  per  second? 


1 


Figure  6.2.6  Trough. 
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14.  A  woman  5  ft  tall  walks  at  the  rate  of  3.5  ft/sec  away  from  a  streetlight  that  is  12  ft  above 
the  ground.  At  what  rate  is  the  tip  of  her  shadow  moving?  At  what  rate  is  her  shadow 
lengthening?  =^ 

15.  A  man  1.8  meters  tall  walks  at  the  rate  of  1  meter  per  second  toward  a  streetlight  that  is  4 
meters  above  the  ground.  At  what  rate  is  the  tip  of  his  shadow  moving?  At  what  rate  is  his 
shadow  shortening? 

16.  A  police  helicopter  is  flying  at  150  mph  at  a  constant  altitude  of  0.5  mile  above  a  straight 
road.  The  pilot  uses  radar  to  determine  that  an  oncoming  car  is  at  a  distance  of  exactly  1 
mile  from  the  helicopter,  and  that  this  distance  is  decreasing  at  190  mph.  Find  the  speed  of 
the  car. 

17.  A  police  helicopter  is  flying  at  200  kilometers  per  hour  at  a  constant  altitude  of  1  km  above 
a  straight  road.  The  pilot  uses  radar  to  determine  that  an  oncoming  car  is  at  a  distance 
of  exactly  2  kilometers  from  the  helicopter,  and  that  this  distance  is  decreasing  at  250  kph. 
Find  the  speed  of  the  car.  =^ 

18.  A  light  shines  from  the  top  of  a  pole  20  m  high.  A  ball  is  falling  10  meters  from  the  pole, 
casting  a  shadow  on  a  building  30  meters  away,  as  shown  in  figure  6.2.7.  When  the  ball  is  25 
meters  from  the  ground  it  is  falling  at  6  meters  per  second.  How  fast  is  its  shadow  moving? 


Figure  6.2.7     Falling  ball. 


19.  Do  example  6.2.6  assuming  that  the  angle  between  the  two  roads  is  120°  instead  of  90°  (that 
is,  the  "north-south"  road  actually  goes  in  a  somewhat  northwesterly  direction  from  P). 
Recall  the  law  of  cosines:      =      +     —  2abcos9.  =^ 

20.  Do  example  6.2.6  assuming  that  car  A  is  300  meters  north  of  P,  car  B  is  400  meters  east 
of  P,  both  cars  are  going  at  constant  speed  toward  P,  and  the  two  cars  will  collide  in  10 
seconds.  =^> 

21.  Do  example  6.2.6  assuming  that  8  seconds  ago  car  A  started  from  rest  at  P  and  has  been 
picking  up  speed  at  the  steady  rate  of  5  m/sec^,  and  6  seconds  after  car  A  started  car  B 
passed  P  moving  east  at  constant  speed  60  m/sec. 

22.  Referring  again  to  example  6.2.6,  suppose  that  instead  of  car  B  an  airplane  is  flying  at  speed 
200  km/hr  to  the  east  of  P  at  an  altitude  of  2  km,  as  depicted  in  figure  6.2.8.  How  fast  is 
the  distance  between  car  and  airplane  changing?  =^ 

23.  Referring  again  to  example  6.2.6,  suppose  that  instead  of  car  B  an  airplane  is  flying  at  speed 
200  km/hr  to  the  east  of  P  at  an  altitude  of  2  km,  and  that  it  is  gaining  altitude  at  10  km/hr. 
How  fast  is  the  distance  between  car  and  airplane  changing? 

24.  A  light  shines  from  the  top  of  a  pole  20  m  high.  An  object  is  dropped  from  the  same  height 
from  a  point  10  m  away,  so  that  its  height  at  time  t  seconds  is  h{t)  =  20  —  9.8t^ /2.  How  fast 
is  the  object's  shadow  moving  on  the  ground  one  second  later? 
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Figure  6.2.8     Car  and  airplane. 


25.  The  two  blades  of  a  pair  of  scissors  are  fastened  at  tiie  point  A  as  shown  in  figure  6.2.9.  Let 
a  denote  the  distance  from  A  to  the  tip  of  the  blade  (the  point  B).  Let  /3  denote  the  angle 
at  the  tip  of  the  blade  that  is  formed  by  the  line  AB  and  the  bottom  edge  of  the  blade,  line 
BC,  and  let  9  denote  the  angle  between  AB  and  the  horizontal.  Suppose  that  a  piece  of 
paper  is  cut  in  such  a  way  that  the  center  of  the  scissors  at  A  is  fixed,  and  the  paper  is  also 
fixed.  As  the  blades  are  closed  (i.e.,  the  angle  9  in  the  diagram  is  decreased),  the  distance  x 
between  A  and  C  increases,  cutting  the  paper. 

a.  Express  x  in  terms  of  o,  9,  and  /3. 

b.  Express  dx/dt  in  terms  of  a,  9,  P,  and  d9/dt. 

c.  Suppose  that  the  distance  a  is  20  cm,  and  the  angle  /3  is  5°.  Further  suppose  that  9 
is  decreasing  at  50  deg/sec.  At  the  instant  when  9  =  30°,  find  the  rate  (in  cm/sec)  at 
which  the  paper  is  being  cut.  =^ 


Figure  6.2.9  Scissors. 


6.3    Newton's  Method 

Suppose  you  have  a  function  /(x),  and  you  want  to  find  as  accurately  as  possible  where 
it  crosses  the  x-axis;  in  other  words,  you  want  to  solve  f{x)  =  0.  Suppose  you  know  of 
no  way  to  find  an  exact  solution  by  any  algebraic  procedure,  but  you  are  able  to  use  an 
approximation,  provided  it  can  be  made  quite  close  to  the  true  value.  Newton's  method  is 
a  way  to  find  a  solution  to  the  equation  to  as  many  decimal  places  as  you  want.  It  is  what 
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is  called  an  "iterative  procedure,"  meaning  that  it  can  be  repeated  again  and  again  to 
get  an  answer  of  greater  and  greater  accuracy.  Iterative  procedures  like  Newton's  method 
are  well  suited  to  programming  for  a  computer.  Newton's  method  uses  the  fact  that  the 
tangent  line  to  a  curve  is  a  good  approximation  to  the  curve  near  the  point  of  tangency. 

EXAMPLE  6.3.1    Approximate  Vs.  Since  V?,  is  a  solution  to      =  3  or     —  3  =  0,  we 

use  f{x)  =  —  3.  We  start  by  guessing  something  reasonably  close  to  the  true  value;  this 
is  usually  easy  to  do;  let's  use  -\/3  ~  2.  Now  use  the  tangent  line  to  the  curve  when  x  =  2 
as  an  approximation  to  the  curve,  as  shown  in  figure  6.3.1.  Since  /'(x)  =  2x,  the  slope  of 
this  tangent  line  is  4  and  its  equation  is  y  =  4x  —  7.  The  tangent  line  is  quite  close  to  /(x), 
so  it  crosses  the  x-axis  near  the  point  at  which  /(x)  crosses,  that  is,  near  -\/3.  It  is  easy 
to  find  where  the  tangent  line  crosses  the  x-axis:  solve  0  =  4x  —  7  to  get  x  =  7/4  =  1.75. 
This  is  certainly  a  better  approximation  than  2,  but  let  us  say  not  close  enough.  We  can 
improve  it  by  doing  the  same  thing  again:  find  the  tangent  line  at  x  =  1.75,  find  where 
this  new  tangent  line  crosses  the  x-axis,  and  use  that  value  as  a  better  approximation.  We 
can  continue  this  indefinitely,  though  it  gets  a  bit  tedious.  Lets  see  if  we  can  shortcut  the 
process.  Suppose  the  best  approximation  to  the  intercept  we  have  so  far  is  Xj.  To  find 
a  better  approximation  we  will  always  do  the  same  thing:  find  the  slope  of  the  tangent 
line  at  Xj,  find  the  equation  of  the  tangent  line,  find  the  x-intercept.  The  slope  is  2xi. 
The  tangent  line  is  y  =  (2xi)(x  —  x^)  -|-  (x^  —  3),  using  the  point-slope  formula  for  a  line. 
Finally,  the  intercept  is  found  by  solving  0  =  (2xi)(x  —  Xi)  -f  (x^  —  3).  With  a  Uttle  algebra 
this  turns  into  x  =  {xf  +  3)/(2xi);  this  is  the  next  approximation,  which  we  naturally  call 
Xi_|_i.  Instead  of  doing  the  whole  tangent  line  computation  every  time  we  can  simply  use 
this  formula  to  get  as  many  approximations  as  we  want.  Starting  with  xq  =  2,  we  get 
xi  =  (xq  -|-3)/(2xo)  =  (2^  +  3)/4  =  7/4  (the  same  approximation  we  got  above,  of  course), 
X2  =  (xl  +  3)/(2xi)  =  ((7/4)2  ^  3)/(7/2)  =  97/56  ^  1.73214,  xs  ^  1.73205,  and  so  on. 
This  is  still  a  bit  tedious  by  hand,  but  with  a  calculator  or,  even  better,  a  good  computer 
program,  it  is  quite  easy  to  get  many,  many  approximations.  We  might  guess  already  that 
1.73205  is  accurate  to  two  decimal  places,  and  in  fact  it  turns  out  that  it  is  accurate  to  5 
places.  □ 

Let's  think  about  this  process  in  more  general  terms.  We  want  to  approximate  a 
solution  to  /(x)  =  0.  We  start  with  a  rough  guess,  which  we  call  xq.  We  use  the  tangent 
line  to  /(x)  to  get  a  new  approximation  that  we  hope  will  be  closer  to  the  true  value. 
What  is  the  equation  of  the  tangent  line  when  x  =  xq?  The  slope  is  /'(xq)  and  the  line 
goes  through  (xq,  /(xq)),  so  the  equation  of  the  line  is 


y  =  f'{xo){x  -  Xq)  +  /(xo). 
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Figure  6.3.1     Newton's  method.  (AP) 
Now  we  find  where  this  crosses  the  x-axis  by  substituting  y  =  0  and  solving  for  x: 

xof'ixo)  -  fixo)  f{xo) 

X  —   —  ^   =  Xq 


f'ixo) 


f'ixo) 


We  will  typically  want  to  compute  more  than  one  of  these  improved  approximations,  so 
we  number  them  consecutively;  from  xq  we  have  computed  xi: 

_  xof'{xo)  -  f{xo)  _     _  f{xo) 

f'M  fixoY 

and  in  general  from  Xi  we  compute  Xj+i: 

_  Xjf'jXi)  -  fjXj)  _       _  f{Xi) 


f'{x,) 


f'{x,)' 


EXAMPLE  6.3.2  Returning  to  the  previous  example,  f{x)  =  —  3,  f'{x)  =  2x,  and 
the  formula  becomes  x^+i  =  Xi  —  [xf  —  3)/(2xi)  =  {xi  +  3)/(2xi),  as  before.  □ 

In  practice,  which  is  to  say,  if  you  need  to  approximate  a  value  in  the  course  of 
designing  a  bridge  or  a  building  or  an  airframe,  you  will  need  to  have  some  confidence  that 
the  approximation  you  settle  on  is  accurate  enough.  As  a  rule  of  thumb,  once  a  certain 
number  of  decimal  places  stop  changing  from  one  approximation  to  the  next  it  is  likely 
that  those  decimal  places  are  correct.  Still,  this  may  not  be  enough  assurance,  in  which 
case  we  can  test  the  result  for  accuracy. 

EXAMPLE  6.3.3  Find  the  x  coordinate  of  the  intersection  of  the  curves  y  =  2x  and 
y  =  tan  x,  accurate  to  three  decimal  places.  To  put  this  in  the  context  of  Newton's  method. 
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we  note  that  we  want  to  know  where  2x  =  tanx  or  f{x)  =  tan  a:  —  2x  =  0.  We  compute 
f'{x)  =  sec^  X  —  2  and  set  up  the  formula: 

tan  x^  2X'i 
sec  X ^ 

From  the  graph  in  figure  6.3.2  we  guess  xq  =  1  as  a  starting  point,  then  using  the  formula 
we  compute  xi  =  1.310478030,  X2  =  1.223929096,  X3  =  1.176050900,  x^  =  1.165926508, 
X5  =  1.165561636.  So  we  guess  that  the  first  three  places  are  correct,  but  that  is  not 
the  same  as  saying  1.165  is  correct  to  three  decimal  places — 1.166  might  be  the  correct, 
rounded  approximation.  How  can  we  tell?  We  can  substitute  1.165,  1.1655  and  1.166  into 
tanx  -  2x-  this  gives  -0.002483652,  -0.000271247,  0.001948654.  Since  the  first  two  are 
negative  and  the  third  is  positive,  tanx  —  2x  crosses  the  x  axis  between  1.1655  and  1.166, 
so  the  correct  value  to  three  places  is  1.166.  □ 


Exercises  6.3. 

1.  Approximate  the  fifth  root  of  7,  using  xq  =  1.5  as  a  first  guess.  Use  Newton's  method  to  find 
xy,  as  your  approximation.  =^ 

2.  Use  Newton's  Method  to  approximate  the  cube  root  of  10  to  two  decimal  places.  =^> 

3.  The  function  f{x)  =  x^  —  Sa;^  —  3a;  +  6  has  a  root  between  3  and  4,  because  /(3)  =  —3  and 
/(4)  =  10.  Approximate  the  root  to  two  decimal  places. 

4.  A  rectangular  piece  of  cardboard  of  dimensions  8  x  17  is  used  to  make  an  open-top  box 
by  cutting  out  a  small  square  of  side  x  from  each  corner  and  bending  up  the  sides.  (See 
exercise  20  in  6.1.)  If  a;  =  2,  then  the  volume  of  the  box  is  2  •  4  •  13  =  104.  Use  Newton's 
method  to  find  a  value  of  x  for  which  the  box  has  volume  100,  accurate  to  3  significant 
figures.  =^ 
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6.4    Linear  Approximations 

Newton's  method  is  one  example  of  the  usefulness  of  the  tangent  line  as  an  approximation 
to  a  curve.  Here  we  explore  another  such  application. 

Recall  that  the  tangent  line  to  f{x)  at  a  point  x  =  a  is  given  by  L{x)  =  f'{a){x  —  a)  + 
/(a).  The  tangent  line  in  this  context  is  also  called  the  linear  approximation  to  /  at  a. 

If  /  is  differentiable  at  a  then  L  is  a  good  approximation  of  /  so  long  as  x  is  "not  too 
far"  from  a.  Put  another  way,  if  /  is  differentiable  at  a  then  under  a  microscope  /  will 
look  very  much  like  a  straight  line.  Figure  6.4.1  shows  a  tangent  line  to  y  =  x^  at  three 
different  magnifications. 

If  we  want  to  approximate  f{b),  because  computing  it  exactly  is  difficult,  we  can 
approximate  the  value  using  a  linear  approximation,  provided  that  we  can  compute  the 
tangent  line  at  some  a  close  to  b. 


Figure  6.4.1     The  linear  approximation  to  y  =  x^. 


EXAMPLE  6.4.1  Let  f{x)  =  Vx  +  A.  Then  f'{x)  =  l/(2Vx  +  4).  The  linear  ap- 
proximation to  /  at  X  =  5  is  L(x)  =  1/(2a/5  +  4)(x  -  5)  +  V5  +  4  =  (x  -  5)/6  +  3.  As 
an  immediate  application  we  can  approximate  square  roots  of  numbers  near  9  by  hand. 
To  estimate  y/lO,  we  substitute  6  into  the  linear  approximation  instead  of  into  /(x),  so 
^6  +  4  ^  (6  -  5)/6  +  3  =  19/6  ^  3.16.  This  rounds  to  3.17  while  the  square  root  of  10  is 
actually  3.16  to  two  decimal  places,  so  this  estimate  is  only  accurate  to  one  decimal  place. 
This  is  not  too  surprising,  as  10  is  really  not  very  close  to  9;  on  the  other  hand,  for  many 
calculations,  3.2  would  be  accurate  enough.  □ 

With  modern  calculators  and  computing  software  it  may  not  appear  necessary  to  use 
linear  approximations.  But  in  fact  they  are  quite  useful.  In  cases  requiring  an  explicit 
numerical  approximation,  they  allow  us  to  get  a  quick  rough  estimate  which  can  be  used 
as  a  "reality  check"  on  a  more  complex  calculation.  In  some  complex  calculations  involving 
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functions,  the  linear  approximation  makes  an  otherwise  intractable  calculation  possible, 
without  serious  loss  of  accuracy. 

EXAMPLE  6.4.2  Consider  the  trigonometric  function  sinx.  Its  linear  approximation 
at  X  =  0  is  simply  L{x)  =  x.  When  x  is  small  this  is  quite  a  good  approximation  and  is 
used  frequently  by  engineers  and  scientists  to  simplify  some  calculations.  □ 

DEFINITION  6.4.3  Let  y  =  f{x)  be  a  differentiable  function.  We  define  a  new 
independent  variable  dx^  and  a  new  dependent  variable  dy  =  f'{x)  dx.  Notice  that  dy  is  a 
function  both  of  x  (since  f'{x)  is  a  function  of  x)  and  of  dx.  We  say  that  dx  and  dy  are 
differentials.  □ 

Let  Ax  =  X  —  a  and  Ay  =  f[x)  —  f{a).  If  x  is  near  a  then  Ax  is  small.  If  we  set 
dx  —  Ax  then 

dy  =  f  (a)  dx  ~       Ax  =  Ay. 

Thus,  dy  can  be  used  to  approximate  Ay,  the  actual  change  in  the  function  /  between  a 
and  X.  This  is  exactly  the  approximation  given  by  the  tangent  line: 

dy  =  /'(a)(x  -  a)  =  /'(a)(x  -  a)  +  f{a)  -  f{a)  =  L(x)  -  /(a). 

While  L{x)  approximates  /(x),  dy  approximates  how  /(x)  has  changed  from  /(a).  Fig- 
ure 6.4.2  illustrates  the  relationships. 


"T 

X 


Figure  6.4.2  Differentials. 
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Exercises  6.4- 

1.  Let  f{x)  =  x'^.  If  a  =  1  and  dx  =  Ax  =  1/2,  what  are  Ay  and  dy? 

2.  Let  f{x)  =  \fx.  If  a  =  1  and  dx  =  Ax  =  1/10,  what  are  Ay  and  dy?  =^ 

3.  Let  f{x)  =  sin(2a:).  If  a  =  tt  and  dx  =  Ax  =  tt/IOO,  what  are  Ay  and  dy? 

4.  Use  differentials  to  estimate  the  amount  of  paint  needed  to  apply  a  coat  of  paint  0.02  cm 
thick  to  a  sphere  with  diameter  40  meters.  (Recall  that  the  volume  of  a  sphere  of  radius  r  is 
V  =  (4/3)7rr^.  Notice  that  you  are  given  that  dr  =  0.02.)  =^> 

5.  Show  in  detail  that  the  linear  approximation  of  sin  a;  at  a;  —  0  is  L(^x)  —  x  and  the  linear 
approximation  of  cosx  at  x  =  0  is  L{x)  =  1. 

6.5    The  Mean  Value  Theorem 

Here  are  two  interesting  questions  involving  derivatives: 

1.  Suppose  two  different  functions  have  the  same  derivative;  what  can  you  say  about 
the  relationship  between  the  two  functions? 

2.  Suppose  you  drive  a  car  from  toll  booth  on  a  toll  road  to  another  toll  booth  at 
an  average  speed  of  70  miles  per  hour.  What  can  be  concluded  about  your  actual 
speed  during  the  trip?  In  particular,  did  you  exceed  the  65  mile  per  hour  speed 
limit? 

While  these  sound  very  different,  it  turns  out  that  the  two  problems  are  very  closely 
related.  We  know  that  "speed"  is  really  the  derivative  by  a  different  name;  let's  start  by 
translating  the  second  question  into  something  that  may  be  easier  to  visualize.  Suppose 
that  the  function  f{t)  gives  the  position  of  your  car  on  the  toll  road  at  time  t.  Your  change 
in  position  between  one  toll  booth  and  the  next  is  given  by  /(ti)  —  f{io),  assuming  that 
at  time  to  you  were  at  the  first  booth  and  at  time  ti  you  arrived  at  the  second  booth. 
Your  average  speed  for  the  trip  is  (/(ti)  —  /(to))/(^i  —  ^o)-  If  we  think  about  the  graph 
of  /(t),  the  average  speed  is  the  slope  of  the  line  that  connects  the  two  points  (to,  /(^o)) 
and  (ti,/(ti)).  Your  speed  at  any  particular  time  t  between  to  and  ti  is  /'(t),  the  slope 
of  the  curve.  Now  question  (2)  becomes  a  question  about  slope.  In  particular,  if  the  slope 
between  endpoints  is  70,  what  can  be  said  of  the  slopes  at  points  between  the  endpoints? 

As  a  general  rule,  when  faced  with  a  new  problem  it  is  often  a  good  idea  to  examine 
one  or  more  simplified  versions  of  the  problem,  in  the  hope  that  this  will  lead  to  an 
understanding  of  the  original  problem.  In  this  case,  the  problem  in  its  "slope"  form  is 
somewhat  easier  to  simplify  than  the  original,  but  equivalent,  problem. 

Here  is  a  special  instance  of  the  problem.  Suppose  that  /(to)  =  /(ti).  Then  the 
two  endpoints  have  the  same  height  and  the  slope  of  the  line  connecting  the  endpoints 
is  zero.  What  can  we  say  about  the  slope  between  the  endpoints?  It  shouldn't  take 
much  experimentation  before  you  are  convinced  of  the  truth  of  this  statement:  Somewhere 
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between  to  and  ti  the  slope  is  exactly  zero,  that  is,  somewhere  between  to  and  ti  the  slope 
is  equal  to  the  slope  of  the  line  between  the  endpoints.  This  suggests  that  perhaps  the  same 
is  true  even  if  the  endpoints  are  at  different  heights,  and  again  a  bit  of  experimentation  will 
probably  convince  you  that  this  is  so.  But  we  can  do  better  than  "experimentation" — we 
can  prove  that  this  is  so. 

We  start  with  the  simplified  version: 

THEOREM  6.5.1  Rolle's  Theorem  Suppose  that  f{x)  has  a  derivative  on  the 
interval  (a,  6),  is  continuous  on  the  interval  [a,  6],  and  /(a)  =  f{b).  Then  at  some  value 
ce  (a,6),  f  (c)  =  0. 

Proof.  We  know  that  f{x)  has  a  maximum  and  minimum  value  on  [a,b]  (because  it 
is  continuous),  and  we  also  know  that  the  maximum  and  minimum  must  occur  at  an 
endpoint,  at  a  point  at  which  the  derivative  is  zero,  or  at  a  point  where  the  derivative  is 
undefined.  Since  the  derivative  is  never  undefined,  that  possibility  is  removed. 

If  the  maximum  or  minimum  occurs  at  a  point  c,  other  than  an  endpoint,  where 
/'(c)  =  0,  then  we  have  found  the  point  we  seek.  Otherwise,  the  maximum  and  minimum 
both  occur  at  an  endpoint,  and  since  the  endpoints  have  the  same  height,  the  maximum 
and  minimum  are  the  same.  This  means  that  f{x)  =  f{a)  =  f{b)  at  every  x  e  [a,b],  so 
the  function  is  a  horizontal  line,  and  it  has  derivative  zero  everywhere  in  {a,b).  Then  we 
may  choose  any  c  at  all  to  get  /'(c)  =  0.  ■ 

Perhaps  remarkably,  this  special  case  is  all  we  need  to  prove  the  more  general  one  as 
well. 

THEOREM  6.5.2  Mean  Value  Theorem  Suppose  that  f{x)  has  a  derivative  on 
the  interval  (a,  6)  and  is  continuous  on  the  interval  [a,b].  Then  at  some  value  c  e  (a,&), 
f{b)  -  /(a) 


f'ic)  = 


b  —  a 


f(b^     f  ( ci] 

Proof.    Let  m  =  ,  and  consider  a  new  function  g(x)  =  f(x)  —m(x  —  a)  —  f  (a) . 

b  —  a 

We  know  that  g{x)  has  a  derivative  everywhere,  since  g'{x)  =  f'{x)  —  m.  We  can  compute 
g{a)  —  /(a)  —  m(a  —  a)  —  /(a)  =  0  and 

g{b)  =  f{b)  -  m{b  -  a)  -  /(a)  =  f{b)  -  M^JM^b  -a)-  f{a) 

0  —  a 

=  fib)  -  (fib)  -  /(a))  -  /(a)  =  0. 
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So  the  height  of  g{x)  is  the  same  at  both  endpoints.  This  means,  by  RoUe's  Theorem,  that 
at  some  c,  g'{c)  =  0.  But  we  know  that  g'{c)  =  /'(c)  —  m,  so 

0  =  /'W-m  =  /'(c)-M^, 

0  —  a 

which  turns  into 

0  —  a 

exactly  what  we  want.  ■ 

Returning  to  the  original  formulation  of  question  (2),  we  see  that  if  f{t)  gives  the 
position  of  your  car  at  time  t,  then  the  Mean  Value  Theorem  says  that  at  some  time  c, 
/'(c)  =  70,  that  is,  at  some  time  you  must  have  been  traveling  at  exactly  your  average 
speed  for  the  trip,  and  that  indeed  you  exceeded  the  speed  limit. 

Now  let's  return  to  question  (1).  Suppose,  for  example,  that  two  functions  are  known  to 
have  derivative  equal  to  5  everywhere,  f'{x)  =  g'{x)  =  5.  It  is  easy  to  find  such  functions: 
5x,  5x  +  47,  5x  —  132,  etc.  Are  there  other,  more  complicated,  examples?  No — the  only 
functions  that  work  are  the  "obvious"  ones,  namely,  5x  plus  some  constant.  How  can  we 
see  that  this  is  true? 

Although  "5"  is  a  very  simple  derivative,  let's  look  at  an  even  simpler  one.  Suppose 
that  f'{x)  =  g'{x)  —  0.  Again  we  can  find  examples:  f{x)  —  0,  f{x)  —  47,  f{x)  —  —511 
all  have  f'{x)  —  0.  Are  there  non-constant  functions  /  with  derivative  0?  No,  and  here's 
why:  Suppose  that  f{x)  is  not  a  constant  function.  This  means  that  there  are  two  points 
on  the  function  with  different  heights,  say  /(a)  ^  f{h).  The  Mean  Value  Theorem  tells 
us  that  at  some  point  c,  /'(c)  =  (/(b)  —  f{a))/{b  —  a)  ^  0.  So  any  non-constant  function 
does  not  have  a  derivative  that  is  zero  everywhere;  this  is  the  same  as  saying  that  the  only 
functions  with  zero  derivative  are  the  constant  functions. 

Let's  go  back  to  the  slightly  less  easy  example:  suppose  that  f'{x)  =  g'{x)  =  5.  Then 
{f{x)  —  g{x)y  —  f'{x)  —  g'{x)  =  5  —  5  =  0.  So  using  what  we  discovered  in  the  previous 
paragraph,  we  know  that  f{x)  —  g{x)  =  k,  for  some  constant  k.  So  any  two  functions  with 
derivative  5  must  differ  by  a  constant;  since  5x  is  known  to  work,  the  only  other  examples 
must  look  like  5x  +  k. 

Now  we  can  extend  this  to  more  complicated  functions,  without  any  extra  work. 
Suppose  that  f'{x)  =  g'{x).  Then  as  before  {f{x)  —  g{x))'  =  f'{x)  —  g'{x)  =  0,  so 
f{x)  —  g{x)  =  k.  Again  this  means  that  if  we  find  just  a  single  function  g{x)  with  a  certain 
derivative,  then  every  other  function  with  the  same  derivative  must  be  of  the  form  g[x)  +  k. 

EXAMPLE  6.5.3  Describe  all  functions  that  have  derivative  5x  —  3.  It's  easy  to  find 
one:  g(x)  =  (5/2)a;^  —  3x  has  g'{x)  =  5x  —  3.  The  only  other  functions  with  the  same 
derivative  are  therefore  of  the  form  f{x)  =  {5/2)x^  —  3x  +  k. 
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Alternately,  though  not  obviously,  you  might  have  first  noticed  that  g{x)  =  (5/2) — 
3a;  +  47  has  g'{x)  =  5x  —  3.  Then  every  other  function  with  the  same  derivative  must 
have  the  form  f(x)  =  (5/2)x^  —  3x  +  47  +  k.  This  looks  different,  but  it  really  isn't. 
The  functions  of  the  form  f{x)  =  (5/2)x^  —  3x  +  /c  are  exactly  the  same  as  the  ones  of 
the  form  f(x)  =  (5/2)x^  —  3x  +  47  +  k.  For  example,  (5/2)a:^  —  3x  +  10  is  the  same  as 
(5/2)x^  —  3x +  47  + (— 37),  and  the  first  is  of  the  first  form  while  the  second  has  the  second 
form.  □ 

This  is  worth  calling  a  theorem: 

THEOREM  6.5.4  If  /'(x)  =  g'{x)  for  every  x  G  (a,  6),  then  for  some  constant  /c, 
/(x)  =  g{x)  +  k  on  the  interval  (a,  b).  m 

EXAMPLE  6.5.5  Describe  all  functions  with  derivative  sinx  +  e^.  One  such  function 
is  —  cos  X  +  e^,  so  all  such  functions  have  the  form  —  cos  x  +     +  k.  □ 

Exercises  6.5. 

1.  Let  f{x)  =  x'^ .  Find  a  value  c  G  (—1,  2)  so  that  /'(c)  equals  the  slope  between  the  endpoints 
of  f  ix)  on  [-1,2].  ^ 

2.  Verify  that  f{x)  =  x/{x  +  2)  satisfies  the  hypotheses  of  the  Mean  Value  Theorem  on  the 
interval  [1,4]  and  then  find  all  of  the  values,  c,  that  satisfy  the  conclusion  of  the  theorem. 

3.  Verify  that  /(x)  =  3a;/(x  +  7)  satisfies  the  hypotheses  of  the  Mean  Value  Theorem  on  the 
interval  [—2,  6]  and  then  find  all  of  the  values,  c,  that  satisfy  the  conclusion  of  the  theorem. 

4.  Let  /(x)  =  tanx.  Show  that  /(tt)  =  /(27r)  =  0  but  there  is  no  number  c  G  (tt,  27r)  such  that 
/'(c)  =  0.  Why  does  this  not  contradict  Rolle's  theorem? 

5.  Let  f{x)  =  {x  —  3)^^.   Show  that  there  is  no  value  c  G  (1,4)  such  that  /'(c)  =  (/(4)  — 
/(I))/ (4—  1).  Why  is  this  not  a  contradiction  of  the  Mean  Value  Theorem? 

6.  Describe  all  functions  with  derivative      +  47x  —  5. 

7.  Describe  all  functions  with  derivative  - — ^ — -. 

1  +  a;2 

3  1 

8.  Describe  all  functions  with  derivative  x  .  =^> 

x 

9.  Describe  all  functions  with  derivative  sin(2a;). 

10.  Show  that  the  equation        —  7a;  +  1  =  0  does  not  have  more  than  two  distinct  real  roots. 

11.  Let  /  be  differentiable  on  M.  Suppose  that  f'{x)  ^  0  for  every  x.  Prove  that  /  has  at  most 
one  real  root. 

12.  Prove  that  for  all  real  x  and  y  |  cos  a;  —  cosy|  <  |x  —  y|.  State  and  prove  an  analogous  result 
involving  sine. 

13.  Show  that  Vl  +  x  <  1  +  (x/2)  if  -1  <  x  <  1. 
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Integration 


7.1     Two  EXAMPLES 

Up  to  now  we  have  been  concerned  with  extracting  information  about  how  a  function 
changes  from  the  function  itself.  Given  knowledge  about  an  object's  position,  for  example, 
we  want  to  know  the  object's  speed.  Given  information  about  the  height  of  a  curve  we 
want  to  know  its  slope.  We  now  consider  problems  that  are,  whether  obviously  or  not,  the 
reverse  of  such  problems. 

EXAMPLE  7.1.1  An  object  moves  in  a  straight  line  so  that  its  speed  at  time  t  is  given 
by  v{t)  —  3t  in,  say,  cm/sec.  If  the  object  is  at  position  10  on  the  straight  line  when  t  —  0, 
where  is  the  object  at  any  time  t? 

There  are  two  reasonable  ways  to  approach  this  problem.  If  s{t)  is  the  position  of  the 
object  at  time  t,  we  know  that  s'{t)  =  v{t).  Because  of  our  knowledge  of  derivatives,  we 
know  therefore  that  s(t)  =  3t^/2  +  k,  and  because  s(0)  =  10  we  easily  discover  that  k  =  10, 
so  s{t)  =  3t^/2  +  10.  For  example,  at  t  =  1  the  object  is  at  position  3/2  +  10  =  11.5.  This 
is  certainly  the  easiest  way  to  deal  with  this  problem.  Not  all  similar  problems  are  so  easy, 
as  we  will  see;  the  second  approach  to  the  problem  is  more  difficult  but  also  more  general. 

We  start  by  considering  how  we  might  approximate  a  solution.  We  know  that  at  t  =  0 
the  object  is  at  position  10.  How  might  we  approximate  its  position  at,  say,  t  —  17  We 
know  that  the  speed  of  the  object  at  time  t  =  0  is  0;  if  its  speed  were  constant  then  in  the 
first  second  the  object  would  not  move  and  its  position  would  still  be  10  when  t  =  1.  In 
fact,  the  object  will  not  be  too  far  from  10  at  t  =  1,  but  certainly  we  can  do  better.  Let's 
look  at  the  times  0.1,  0.2,  0.3,  . . . ,  1.0,  and  try  approximating  the  location  of  the  object 
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at  each,  by  supposing  that  during  each  tenth  of  a  second  the  object  is  going  at  a  constant 
speed.  Since  the  object  initially  has  speed  0,  we  again  suppose  it  maintains  this  speed,  but 
only  for  a  tenth  of  second;  during  that  time  the  object  would  not  move.  During  the  tenth 
of  a  second  from  t  =  0.1  to  t  =  0.2,  we  suppose  that  the  object  is  traveling  at  0.3  cm/sec, 
namely,  its  actual  speed  ai  t  —  0.1.  In  this  case  the  object  would  travel  (0.3)  (0.1)  =  0.03 
centimeters:  0.3  cm/sec  times  0.1  seconds.  Similarly,  between  t  =  0.2  and  t  =  0.3  the 
object  would  travel  (0.6)  (0.1)  =  0.06  centimeters.  Continuing,  we  get  as  an  approximation 
that  the  object  travels 

(0.0)(0.1)  +  (0.3)(0.1)  +  (0.6)(0.1)  +  •  ■  •  +  (2.7)(0.1)  =  1.35 

centimeters,  ending  up  at  position  11.35.  This  is  a  better  approximation  than  10,  certainly, 
but  is  still  just  an  approximation.  (We  know  in  fact  that  the  object  ends  up  at  position 
11.5,  because  we've  already  done  the  problem  using  the  first  approach.)  Presumably, 
we  will  get  a  better  approximation  if  we  divide  the  time  into  one  hundred  intervals  of  a 
hundredth  of  a  second  each,  and  repeat  the  process: 

(0.0)(0.01)  +  (0.03)(0.01)  +  (0.06)(0.01)  +  ■  ■  ■  +  (2.97)(0.01)  =  1.485. 

We  thus  approximate  the  position  as  11.485.  Since  we  know  the  exact  answer,  we  can  see 
that  this  is  much  closer,  but  if  we  did  not  already  know  the  answer,  we  wouldn't  really 
know  how  close. 

We  can  keep  this  up,  but  we'll  never  really  know  the  exact  answer  if  we  simply  compute 
more  and  more  examples.  Let's  instead  look  at  a  "typical"  approximation.  Suppose  we 
divide  the  time  into  n  equal  intervals,  and  imagine  that  on  each  of  these  the  object  travels 
at  a  constant  speed.  Over  the  first  time  interval  we  approximate  the  distance  traveled 
as  (0.0)  (1/n)  =  0,  as  before.  During  the  second  time  interval,  from  t  =  l/ntot  =  2/n, 
the  object  travels  approximately  3(l/n)(l/n)  =  3/n^  centimeters.  During  time  interval 
number  i,  the  object  travels  approximately  (3(i  —  l)/n)(l/n)  =  3{i  —  l)/n^  centimeters, 
that  is,  its  speed  at  time  (z  —  l)/n,  3(z  —  l)/n,  times  the  length  of  time  interval  number  i, 
1/n.  Adding  these  up  as  before,  we  approximate  the  distance  traveled  as 

(0)-  +  3^  +  3(2)^  +  3(3)^  +  •  •  •  +  3(n  -  1)^ 

centimeters.  What  can  we  say  about  this?  At  first  it  looks  rather  less  useful  than  the 
concrete  calculations  we've  already  done.  But  in  fact  a  bit  of  algebra  reveals  it  to  be  much 
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more  useful.  We  can  factor  out  a  3  and  1/n^  to  get 

^(0  +  l  +  2  +  3  +  ---  +  (n-l)), 

that  is,  times  the  sum  of  the  first  n  —  1  positive  integers.  Now  we  make  use  of  a  fact 
you  may  have  run  across  before: 

,      k(k  +  l) 
l  +  2  +  3  +  ---  +  /c=    ^  ^  \ 

In  our  case  we're  interested  in  k  —  n  —  1,  so 

1  +  2  +  3  +  •  •  •  +  (n  -  1)  =  ^  2^  =  • 

This  simphfies  the  approximate  distance  traveled  to 


2         2  2  J      2  \  n 

Now  this  is  quite  easy  to  understand:  as  n  gets  larger  and  larger  this  approximation  gets 
closer  and  closer  to  (3/2)(l  —  0)  =  3/2,  so  that  3/2  is  the  exact  distance  traveled  during 
one  second,  and  the  final  position  is  11.5. 

So  for  t  =  1,  at  least,  this  rather  cumbersome  approach  gives  the  same  answer  as 
the  first  approach.  But  really  there's  nothing  special  about  t  =  1;  let's  just  call  it  t 
instead.  In  this  case  the  approximate  distance  traveled  during  time  interval  number  i  is 
3{i  —  l){t/n){t/n)  =  3{i  —  l)t^ /n^,  that  is,  speed  3(z  —  l){t/n)  times  time  t/n,  and  the 
total  distance  traveled  is  approximately 

(0)-  +  3(1)4  +  3(2)4  +  3(3)4  +  •  •  •  +  3(n  -  1)4- 
As  before  we  can  simplify  this  to 

3^2  3^2  ^2  -  n      3  2  /  1 

In  the  limit,  as  n  gets  larger,  this  gets  closer  and  closer  to  (3/2)^^  and  the  approximated 
position  of  the  object  gets  closer  and  closer  to  (3/2)^^  +  10,  so  the  actual  position  is 
(3/2)^2  +  10,  exactly  the  answer  given  by  the  first  approach  to  the  problem.  □ 


EXAMPLE  7.1.2  Find  the  area  under  the  curve  y  =  3x  between  x  =  0  and  any  positive 
value  X.  There  is  here  no  obvious  analogue  to  the  first  approach  in  the  previous  example. 
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but  the  second  approach  works  fine.  (Because  the  function  y  =  3x  is  so  simple,  there 
is  another  approach  that  works  here,  but  it  is  even  more  Umited  in  potential  application 
than  is  approach  number  one.)  How  might  we  approximate  the  desired  area?  We  know 
how  to  compute  areas  of  rectangles,  so  we  approximate  the  area  by  rectangles.  Jumping 
straight  to  the  general  case,  suppose  we  divide  the  interval  between  0  and  x  into  n  equal 
sub  intervals,  and  use  a  rectangle  above  each  sub  interval  to  approximate  the  area  under 
the  curve.  There  are  many  ways  we  might  do  this,  but  let's  use  the  height  of  the  curve 
at  the  left  endpoint  of  the  subinterval  as  the  height  of  the  rectangle,  as  in  figure  7.1.1. 
The  height  of  rectangle  number  i  is  then  3{i  —  l){x/n),  the  width  is  x/n,  and  the  area  is 
3(z  —  l)(x^/n^).  The  total  area  of  the  rectangles  is 

2  2  2  2 

(0)-  +  3(1)^  +  3(2):^^  +  3(3)^  +  •  •  •  +  3(n  -  1)^. 

n  n  n  n  n 

By  factoring  out  "ix^  jr?  this  simplifies  to 

.     „  ,       ...      3x2      -  n         ^  ( ^  1 


(0  +  1  +  2  +  ■  ■  ■  +  (n  -  1))  =  -^^^  =  -x'\\-- 


2         2      \  n 

As  n  gets  larger  this  gets  closer  and  closer  to  3x^/2,  which  must  therefore  be  the  true  area 
under  the  curve.  □ 


Figure  7.1.1     Approximating  the  area  under  y  =  3x  with  rectangles.  Drag  the  slider  to 
change  the  number  of  rectangles. 

What  you  will  have  noticed,  of  course,  is  that  while  the  problem  in  the  second  example 
appears  to  be  much  different  than  the  problem  in  the  first  example,  and  while  the  easy 
approach  to  problem  one  does  not  appear  to  apply  to  problem  two,  the  "approximation" 
approach  works  in  both,  and  moreover  the  calculations  are  identical.  As  we  will  see,  there 
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are  many,  many  problems  that  appear  much  different  on  the  surface  but  that  turn  out  to 
be  the  same  as  these  problems,  in  the  sense  that  when  we  try  to  approximate  solutions  we 
end  up  with  mathematics  that  looks  like  the  two  examples,  though  of  course  the  function 
involved  will  not  always  be  so  simple. 

Even  better,  we  now  see  that  while  the  second  problem  did  not  appear  to  be  amenable 
to  approach  one,  it  can  in  fact  be  solved  in  the  same  way.  The  reasoning  is  this:  we  know 
that  problem  one  can  be  solved  easily  by  finding  a  function  whose  derivative  is  3t.  We 
also  know  that  mathematically  the  two  problems  are  the  same,  because  both  can  be  solved 
by  taking  a  limit  of  a  sum,  and  the  sums  are  identical.  Therefore,  we  don't  really  need 
to  compute  the  limit  of  either  sum  because  we  know  that  we  will  get  the  same  answer  by 
computing  a  function  with  the  derivative  3t  or,  which  is  the  same  thing,  3a;. 

It's  true  that  the  first  problem  had  the  added  complication  of  the  "10" ,  and  we  certainly 
need  to  be  able  to  deal  with  such  minor  variations,  but  that  turns  out  to  be  quite  simple. 
The  lesson  then  is  this:  whenever  we  can  solve  a  problem  by  taking  the  limit  of  a  sum  of 
a  certain  form,  we  can  instead  of  computing  the  (often  nasty)  limit  find  a  new  function 
with  a  certain  derivative. 

Exercises  1.1. 

1.  Suppose  an  object  moves  in  a  straight  line  so  that  its  speed  at  time  t  is  given  by  v(t)  =  2t  +  2, 
and  that  aX  t  =  1  the  object  is  at  position  5.  Find  the  position  of  the  object  at  t  =  2. 

2.  Suppose  an  object  moves  in  a  straight  Une  so  that  its  speed  at  time  t  is  given  by  v{i)  =  t'^  +2, 
and  that  at  t  =  0  the  object  is  at  position  5.  Find  the  position  of  the  object  at  t  =  2.  =^ 

3.  By  a  method  similar  to  that  in  example  7.1.2,  find  the  area  under  y  =  2x  between  x  =  0  and 
any  positive  value  for  x.  =^ 

4.  By  a  method  similar  to  that  in  example  7.1.2,  find  the  area  under  y  =  Ax  between  x  =  0  and 
any  positive  value  for  x.  =^> 

5.  By  a  method  similar  to  that  in  example  7.1.2,  find  the  area  under  y  =  4x  between  x  =  2  and 
any  positive  value  for  x  bigger  than  2.  =^> 

6.  By  a  method  similar  to  that  in  example  7.1.2,  find  the  area  under  y  =  Ax  between  any  two 
positive  values  for  x,  say  a  <  6.  =^ 

7.  Let  f{x)  =  x'^  +  3x  +  2.  Approximate  the  area  under  the  curve  between  x  =  0  and  x  =  2 
using  4  rectangles  and  also  using  8  rectangles.  =^ 

8.  Let  f{x)  =  x^  —  2x  +  3.  Approximate  the  area  under  the  curve  between  x  =  1  and  a;  =  3 
using  4  rectangles. 

7.2    The  Fundamental  Theorem  of  Calculus 

Let's  recast  the  first  example  from  the  previous  section.  Suppose  that  the  speed  of  the 
object  is  3t  at  time  t.  How  far  does  the  object  travel  between  time  t  =  a  and  time  t  =  b7 
We  are  no  longer  assuming  that  we  know  where  the  object  is  at  time  t  =  0  or  at  any  other 
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time.  It  is  certainly  true  that  it  is  somewhere,  so  let's  suppose  that  at  t  =  0  the  position  is  k. 
Then  just  as  in  the  example,  we  know  that  the  position  of  the  object  at  any  time  is  3t^/2+k. 
This  means  that  at  time  t  =  a  the  position  is  3a^/2  +  k  and  at  time  t  =  b  the  position  is 
36^2  +  k.  Therefore  the  change  in  position  is  3&V2  +  k-  {3a^/2  +  k)  =  3b'^/2  -  3a^/2. 
Notice  that  the  k  drops  out;  this  means  that  it  doesn't  matter  that  we  don't  know  k,  it 
doesn't  even  matter  if  we  use  the  wrong  k,  we  get  the  correct  answer.  In  other  words,  to 
find  the  change  in  position  between  time  a  and  time  b  we  can  use  any  antiderivative  of  the 
speed  function  3t — it  need  not  be  the  one  antiderivative  that  actually  gives  the  location  of 
the  object. 

What  about  the  second  approach  to  this  problem,  in  the  new  form?  We  now  want  to 
approximate  the  change  in  position  between  time  a  and  time  b.  We  take  the  interval  of 
time  between  a  and  b,  divide  it  into  n  subintervals,  and  approximate  the  distance  traveled 
during  each.  The  starting  time  of  subinterval  number  i  is  now  a  +  {i  —  l){b  —  a) /n,  which 
we  abbreviate  as  U^i,  so  that  to  =  a,  ti  =  a  +  (b  —  a)/n,  and  so  on.  The  speed  of  the 
object  is  f{t)  =  3t,  and  each  subinterval  is  (6  —  a)/n  =  At  seconds  long.  The  distance 
traveled  during  subinterval  number  i  is  approximately  f{ti-i)At,  and  the  total  change  in 
distance  is  approximately 

f{tQ)At  +  f{t,)At  +■■■  +  f{tn-l)At. 

The  exact  change  in  position  is  the  limit  of  this  sum  as  n  goes  to  infinity.  We  abbreviate 
this  sum  using  sigma  notation: 

n-1 

^  f{ti)At  =  f{to)At  +  f{ti)At  +  ■■■  +  f{tn-i)At. 

The  notation  on  the  left  side  of  the  equal  sign  uses  a  large  capital  sigma,  a  Greek  letter, 
and  the  left  side  is  an  abbreviation  for  the  right  side.  The  answer  we  seek  is 

n-1 

lim  y^f{ti)At. 

i=0 

Since  this  must  be  the  same  as  the  answer  we  have  already  obtained,  we  know  that 

hm  Y,f{t,)At=-  -. 

i=0 

The  significance  of  3t^ /2,  into  which  we  substitute  t  =  b  and  t  =  a,  is  of  course  that  it  is 
a  function  whose  derivative  is  f{t).  As  we  have  discussed,  by  the  time  we  know  that  we 
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want  to  compute 

n-1 

lim  V/(ti)At, 

n. — ^rv-i  '  * 


n—^oo 

i=0 


it  no  longer  matters  what  f{t)  stands  for — it  could  be  a  speed,  or  the  height  of  a  curve, 
or  something  else  entirely.    We  know  that  the  limit  can  be  computed  by  finding  any 
function  with  derivative  fit),  substituting  a  and  6,  and  subtracting.  We  summarize  this 
in  a  theorem.  First,  we  introduce  some  new  notation  and  terms. 
We  write 

/   f{t)dt=  lim  y^f{U)At 

if  the  limit  exists.  That  is,  the  left  hand  side  means,  or  is  an  abbreviation  for,  the  right 
hand  side.  The  symbol  J  is  called  an  integral  sign,  and  the  whole  expression  is  read 
as  "the  integral  of  f{t)  from  a  to  6."  What  we  have  learned  is  that  this  integral  can  be 
computed  by  finding  a  function,  say  F{t),  with  the  property  that  F'{t)  =  f{t),  and  then 
computing  F{b)  —  F{a).  The  function  F{t)  is  called  an  antiderivative  of  f{t).  Now  the 
theorem: 

THEOREM  7.2.1  Fundamental  Theorem  of  Calculus  Suppose  that  f{x)  is 
continuous  on  the  interval  [a,b].  If  F{x)  is  any  antiderivative  of  f{x),  then 


/ 

J  a 


b 

f{x)dx  =  F{b)  -  F{a). 


□ 


Let's  rewrite  this  slightly: 


[  f{t)dt^F{x)-F{a). 

J  a 


We've  replaced  the  variable  xhj  t  and  h  by  x.  These  are  just  different  names  for  quantities, 
so  the  substitution  doesn't  change  the  meaning.  It  does  make  it  easier  to  think  of  the  two 
sides  of  the  equation  as  functions.  The  expression 


J  a 


f{t)dt 


is  a  function:  plug  in  a  value  for  x,  get  out  some  other  value.  The  expression  F{x)  —  F{a) 
is  of  course  also  a  function,  and  it  has  a  nice  property: 

^{F{x)-F{a))  =  F'{x)  =  f{x), 
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since  F{a)  is  a  constant  and  has  derivative  zero.  In  other  words,  by  shifting  our  point  of 
view  shghtly,  we  see  that  the  odd  looking  function 


has  a  derivative,  and  that  in  fact  G'{x)  =  f{x).  This  is  reaUy  just  a  restatement  of  the 
Fundamental  Theorem  of  Calculus,  and  indeed  is  often  called  the  Fundamental  Theorem 
of  Calculus.  To  avoid  confusion,  some  people  call  the  two  versions  of  the  theorem  "The 
Fundamental  Theorem  of  Calculus,  part  I"  and  "The  Fundamental  Theorem  of  Calculus, 
part  II" ,  although  unfortunately  there  is  no  universal  agreement  as  to  which  is  part  I  and 
which  part  II.  Since  it  really  is  the  same  theorem,  differently  stated,  some  people  simply 
call  them  both  "The  Fundamental  Theorem  of  Calculus." 

THEOREM  7.2.2  Fundamental  Theorem  of  Calculus  Suppose  that  f{x)  is 
continuous  on  the  interval  [a,  b]  and  let 


We  have  not  really  proved  the  Fundamental  Theorem.  In  a  nutshell,  we  gave  the 
following  argument  to  justify  it:  Suppose  we  want  to  know  the  value  of 


We  can  interpret  the  right  hand  side  as  the  distance  traveled  by  an  object  whose  speed 
is  given  by  f{t).  We  know  another  way  to  compute  the  answer  to  such  a  problem:  find 
the  position  of  the  object  by  finding  an  antiderivative  of  /(t),  then  substitute  t  =  a  and 
t  —  b  and  subtract  to  find  the  distance  traveled.  This  must  be  the  answer  to  the  original 
problem  as  well,  even  if  f{t)  does  not  represent  a  speed. 

What's  wrong  with  this?  In  some  sense,  nothing.  As  a  practical  matter  it  is  a  very 
convincing  argument,  because  our  understanding  of  the  relationship  between  speed  and 
distance  seems  to  be  quite  solid.  From  the  point  of  view  of  mathematics,  however,  it 
is  unsatisfactory  to  justify  a  purely  mathematical  relationship  by  appeaUng  to  our  un- 
derstanding of  the  physical  universe,  which  could,  however  unlikely  it  is  in  this  case,  be 
wrong. 

A  complete  proof  is  a  bit  too  involved  to  include  here,  but  we  will  indicate  how  it  goes. 
First,  if  we  can  prove  the  second  version  of  the  Fundamental  Theorem,  theorem  7.2.2,  then 
we  can  prove  the  first  version  from  that: 


Then  G'{x)  =  f{x). 


□ 


b 


n-1 
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Proof  of  Theorem  7.2.1.      We  know  from  theorem  7.2.2  that 

G{x)  =  [  fit)  dt 


is  an  antiderivative  of  /(x),  and  therefore  any  antiderivative  F{x)  of  f{x)  is  of  the  form 
F{x)  =  G{x)  +  k.  Then 

F{b)  -  F{a)  =  G{b)  +k-  {G{a)  +  k)  =  G{h)  -  G{a) 

f(t)dt-    /  f{t)dt. 


r 

It  is  not  hard  to  see  that  /    f{t)  dt  =  0,  so  this  means  that 

J  a 

Fib)-F{a)=  f  f{t)dt, 

J  a 

which  is  exactly  what  theorem  7.2.1  says.  ■ 

So  the  real  job  is  to  prove  theorem  7.2.2.  We  will  sketch  the  proof,  using  some  facts 
that  we  do  not  prove.  First,  the  following  identity  is  true  of  integrals: 

l*h  i*c  pb 

/    f{t)dt=    /    f{t)dt+    /  f{t)dt. 
J  a  J  a  J  c 

This  can  be  proved  directly  from  the  definition  of  the  integral,  that  is,  using  the  limits 
of  sums.  It  is  quite  easy  to  see  that  it  must  be  true  by  thinking  of  either  of  the  two 
applications  of  integrals  that  we  have  seen.  It  turns  out  that  the  identity  is  true  no  matter 
what  c  is,  but  it  is  easiest  to  think  about  the  meaning  when  a  <  c  <  b. 

First,  if  f{t)  represents  a  speed,  then  we  know  that  the  three  integrals  represent  the 
distance  traveled  between  time  a  and  time  b;  the  distance  traveled  between  time  a  and 
time  c;  and  the  distance  traveled  between  time  c  and  time  b.  Clearly  the  sum  of  the  latter 
two  is  equal  to  the  first  of  these. 

Second,  if  f{t)  represents  the  height  of  a  curve,  the  three  integrals  represent  the  area 
under  the  curve  between  a  and  b;  the  area  under  the  curve  between  a  and  c;  and  the  area 
under  the  curve  between  c  and  b.  Again  it  is  clear  from  the  geometry  that  the  first  is  equal 
to  the  sum  of  the  second  and  third. 
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Proof  sketch  for  Theorem  7.2.2.  We  want  to  compute  G'{x),  so  we  start  with  the 
definition  of  the  derivative  in  terms  of  a  hmit: 

G'W=  lim  G(:.  +  A.)-G(x) 


Ax 


1 


r"X+ Ax 


=  hm 

Ax 


r>X  + Ax 

Ax"-^0  Ax 

Now  we  need  to  know  something  about 

r>X  +  Ax 


/ 


f{t)dt 


when  Ax  is  smaU;  in  fact,  it  is  very  close  to  Axf{x),  but  we  wiU  not  prove  this.  Once 
again,  it  is  easy  to  beheve  this  is  true  by  thinking  of  our  two  apphcations:  The  integral 

/■x+Ax 

/  f{t)dt 

J  X 

can  be  interpreted  as  the  distance  traveled  by  an  object  over  a  very  short  interval  of  time. 
Over  a  sufficiently  short  period  of  time,  the  speed  of  the  object  will  not  change  very  much, 
so  the  distance  traveled  will  be  approximately  the  length  of  time  multiplied  by  the  speed  at 
the  beginning  of  the  interval,  namely,  Axf{x).  Alternately,  the  integral  may  be  interpreted 
as  the  area  under  the  curve  between  x  and  x  +  Ax.  When  Ax  is  very  small,  this  will  be 
very  close  to  the  area  of  the  rectangle  with  base  Ax  and  height  /(x);  again  this  is  Ax/(x). 
If  we  accept  this,  we  may  proceed: 

1      r^^"        ^   ,  V        Ax/(X)  , 

lim  —  /         f(t)dt=  lim  — =  /  x  , 
Ax^oAxJ^  Ax^O     Ax         ^  ^ 

which  is  what  we  wanted  to  show.  ■ 

It  is  still  true  that  we  are  depending  on  an  interpretation  of  the  integral  to  justify  the 
argument,  but  we  have  isolated  this  part  of  the  argument  into  two  facts  that  are  not  too 
hard  to  prove.  Once  the  last  reference  to  interpretation  has  been  removed  from  the  proofs 
of  these  facts,  we  will  have  a  real  proof  of  the  Fundamental  Theorem. 
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Now  we  know  that  to  solve  certain  kinds  of  problems,  those  that  lead  to  a  sum  of  a 
certain  form,  we  "merely"  find  an  antiderivative  and  substitute  two  values  and  subtract. 
Unfortunately,  finding  antiderivatives  can  be  quite  difficult.  While  there  are  a  small  number 
of  rules  that  allow  us  to  compute  the  derivative  of  any  common  function,  there  arc  no  such 
rules  for  antiderivatives.  There  are  some  techniques  that  frequently  prove  useful,  but  we 
will  never  be  able  to  reduce  the  problem  to  a  completely  mechanical  process. 

Because  of  the  close  relationship  between  an  integral  and  an  antiderivative,  the  integral 
sign  is  also  used  to  mean  "antiderivative" .  You  can  tell  which  is  intended  by  whether  the 
limits  of  integration  are  included: 

dx 


i: 


is  an  ordinary  integral,  also  called  a  definite  integral,  because  it  has  a  definite  value, 
namely 

/   x^  dx  =  —  TT  =  t:- 

Ji  3      3  3 

We  use 

x^  dx 


to  denote  the  antiderivative  of  x  ,  also  called  an  indefinite  integral.  So  this  is  evaluated 
as 


X 


3 


X''  dx  =  Y  +  c. 

It  is  customary  to  include  the  constant  C  to  indicate  that  there  are  really  an  infinite 
number  of  antiderivatives.  We  do  not  need  this  C  to  compute  definite  integrals,  but  in 
other  circumstances  we  will  need  to  remember  that  the  C  is  there,  so  it  is  best  to  get 
into  the  habit  of  writing  the  C.  When  we  compute  a  definite  integral,  we  first  find  an 
antiderivative  and  then  substitute.  It  is  convenient  to  first  display  the  antiderivative  and 
then  do  the  substitution;  we  need  a  notation  indicating  that  the  substitution  is  yet  to  be 
done.  A  typical  solution  would  look  like  this: 


x^ 

x^  dx  =  — 


23      i3  ^  7 

y  ~  y  ^  3' 


The  vertical  line  with  subscript  and  superscript  is  used  to  indicate  the  operation  "substitute 
and  subtract"  that  is  needed  to  finish  the  evaluation. 
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Exercises  7.2. 

Find  the  antiderivatives  of  the  functions: 

1.     8^/x  ^ 

3. 

5.    7s"^  ^ 
7.    {x  -  6f  =^ 

9.  ' 


2.  3r  + 1  ^ 

4.  ^ 

6.    (5x  +  l)^ 


3/2 


8.  X 


10.    |2t-4|  ^ 


Compute  the  values  of  the  integrals: 


11 


■i: 


t  +  3t  dt 


13. 
15. 


10 


dx 


x^  dx 


12. 


14. 


16. 


17.  Find  the  derivative  of  G{x 

18.  Find  the  derivative  of  G{x 

19.  Find  the  derivative  of  G{x 

20.  Find  the  derivative  of  G{x 

21.  Find  the  derivative  of  G{x 

22.  Find  the  derivative  of  G{x 


i: 


r  -  3t  dt 


t  -  3t  dt 


e  dt 


e  dt 


tan(t  )  dt 


i: 

tan(i^ 


sin  t  dt 


dx 


x^  dx 


7.3    Some  Properties  of  Integrals 

Suppose  an  object  moves  so  that  its  speed,  or  more  properly  velocity,  is  given  by  v{t)  = 
—t^  +  5t,  as  shown  in  figure  7.3.1.  Let's  examine  the  motion  of  this  object  carefully. 
We  know  that  the  velocity  is  the  derivative  of  position,  so  position  is  given  by  s{t)  = 
+  5t^/2  +  C.  Let's  suppose  that  at  time  t  =  0  the  object  is  at  position  0,  so 
s{t)  =  +  5t^/2;  this  function  is  also  pictured  in  figure  7.3.1. 

Between  t  =  0  and  t  =  5  the  velocity  is  positive,  so  the  object  moves  away  from  the 
starting  point,  until  it  is  a  bit  past  position  20.  Then  the  velocity  becomes  negative  and 
the  object  moves  back  toward  its  starting  point.  The  position  of  the  object  at  t  =  5  is 
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Figure  7.3.1     The  velocity  of  an  object  and  its  position.  You  can  drag  the  blue  dot  and 
the  red  dot  will  follow. 

exactly  s(5)  =  125/6,  and  at  t  =  6  it  is  s(6)  =  18.  The  total  distance  traveled  by  the 
object  is  therefore  125/6  +  (125/6  -  18)  =  71/3  23.7. 

As  we  have  seen,  we  can  also  compute  distance  traveled  with  an  integral;  let's  try  it. 


/   v{t)dt  = 
Jo  Jo 


-t'^  +  5tdt  = 


=  18. 


What  went  wrong?  Well,  nothing  really,  except  that  it's  not  really  true  after  all  that  "we 
can  also  compute  distance  traveled  with  an  integral" .  Instead,  as  you  might  guess  from  this 
example,  the  integral  actually  computes  the  net  distance  traveled,  that  is,  the  difference 
between  the  starting  and  ending  point. 
As  we  have  already  seen. 


v{t)dt+  /  v{t)dt 


v{t)dt 


Computing  the  two  integrals  on  the  right  (do  it!)  gives  125/6  and  —17/6,  and  the  sum  of 
these  is  indeed  18.  But  what  does  that  negative  sign  mean?  It  means  precisely  what  you 
might  think:  it  means  that  the  object  moves  backwards.  To  get  the  total  distance  traveled 
we  can  add  125/6  +  17/6  =  71/3,  the  same  answer  we  got  before. 

Remember  that  we  can  also  interpret  an  integral  as  measuring  an  area,  but  now  we 
see  that  this  too  is  a  little  more  complicated  that  we  have  suspected.  The  area  under  the 
curve  v{t)  from  0  to  5  is  given  by 


/  v{t)dt  = 
Jo 


125 

"6"^ 


and  the  "area"  from  5  to  6  is 


6  -^j 

v{t)dt  =  -  — 
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In  other  words,  the  area  between  the  x-axis  and  the  curve,  but  under  the  a;-axis,  "counts 
as  negative  area".  So  the  integral 

f  v{t)  dt  =  18 
Jo 

measures  "net  area",  the  area  above  the  axis  minus  the  (positive)  area  below  the  axis. 

If  we  recall  that  the  integral  is  the  limit  of  a  certain  kind  of  sum,  this  behavior  is  not 
surprising.  Recall  the  sort  of  sum  involved: 


i=0 

In  each  term  v{t)At  the  At  is  positive,  but  if  v{ti)  is  negative  then  the  term  is  negative.  If 
over  an  entire  interval,  like  5  to  6,  the  function  is  always  negative,  then  the  entire  sum  is 
negative.  In  terms  of  area,  v{t)At  is  then  a  negative  height  times  a  positive  width,  giving 
a  negative  rectangle  " 

So  now  we  see  that  when  evaluating 

6  -^j 

v{t)dt  =  -  — 

by  finding  an  antiderivative,  substituting,  and  subtracting,  we  get  a  surprising  answer,  but 
one  that  turns  out  to  make  sense. 

Let's  now  try  something  a  bit  different: 


5^     5,2     -6^     5„o  17 


+  7:5^  ^  = 


g       3       2  3       2  6 


Here  we  simply  interchanged  the  limits  5  and  6,  so  of  course  when  we  substitute  and 
subtract  we're  subtracting  in  the  opposite  order  and  we  end  up  multiplying  the  answer 
by  —1.  This  too  makes  sense  in  terms  of  the  underlying  sum,  though  it  takes  a  bit  more 
thought.  Recall  that  in  the  sum 


n-1 


i=0 


the  At  is  the  "length"  of  each  little  subinterval,  but  more  precisely  we  could  say  that 
At  =  ti+i  —  ti,  the  difference  between  two  endpoints  of  a  subinterval.  We  have  until  now 
assumed  that  we  were  working  left  to  right,  but  could  as  well  number  the  subintervals  from 
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right  to  left,  so  that  to  =  b  and  t„  =  a.  Then  At  =  tj+i  —  ti  is  negative  and  in 


„5  n-1 
•^6  i=o 


the  values  t'(ti)  are  negative  but  also  At  is  negative,  so  all  terms  are  positive  again.  On 
the  other  hand,  in 

fO  n-l 


/  v{t)  dt  =  Y^  v{ti)At, 


i=0 

the  values  v{ti)  are  positive  but  At  is  negative, and  we  get  a  negative  result: 

0  -.3 


v{t)dt 


 h  -t 

3  2 


3  2 


125 


6 


Finally  we  note  one  simple  property  of  integrals: 


/   f{x)  +  gi^)  dx  —      f{x)  dx  +      g{x)  dx. 

J  a  J  a  J  a 

This  is  easy  to  understand  once  you  recall  that  {F{x)  +  G{x))'  =  F'{x)  +  G'{x).  Hence,  if 
F'{x)  =  f{x)  and  G'{x)  =  g{x),  then 


/  f{x)+g{x)dx=  {F{x)  +  G{x))\i 

J  a 

=  F{b)  +  G{b)  -  F{a)  -  G{a) 
=  F{b)  -  F{a)  +  G{b)  -  G{a) 

=  F{x)t+G{x)t 

pb  pb 

=  /   f{x)  dx  +      g{x)  dx. 

J  a  J  a 

In  summary,  we  will  frequently  use  these  properties  of  integrals: 

f{x)  dx 


pc  pb 

i   f{x)dx+  /  f{x)dx 

a  J  c 

pb  pb  pb 

I   f{x)+g{x)dx—  /   f{x)dx+  /   g{x)  dx 

J  a  J  a  J  a 


b  pa 

f{x)dx  —  —  /  f{x)dx 
Jb 
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and  if  a  <  6  and  /(x)  <  0  on  [a,  b]  then 


f{x)dx  <  0 


and  in  fact 


b  pb 
f{x)dx  =  —  /  \f{x)\dx. 


Exercises  7.3. 

1.  An  object  moves  so  that  its  velocity  at  time  t  is  v{t)  =  —9.8t  +  20  m/s.  Describe  the  motion 
of  the  object  between  t  =  0  and  t  =  5,  find  the  total  distance  traveled  by  the  object  during 
that  time,  and  find  the  net  distance  traveled. 

2.  An  object  moves  so  that  its  velocity  at  time  t  is  v{t)  =  sini.  Set  up  and  evaluate  a  single 
definite  integral  to  compute  the  net  distance  traveled  between  t  =  0  and  t  =  2tt.  =^ 

3.  An  object  moves  so  that  its  velocity  at  time  t  is  v{t)  =  1  +  2  sini  m/s.  Find  the  net  distance 
traveled  by  the  object  between  t  =  0  and  t  =  27r,  and  find  the  total  distance  traveled  during 
the  same  period. 

4.  Consider  the  function  f{x)  =  (x  +  2)(x  +  —  l)(a;  —  2)  on  [—2,2].  Find  the  total  area 
between  the  curve  and  the  x-axis  (measuring  all  area  as  positive).  =^ 

5.  Consider  the  function  f{x)  =  —  3a;  +  2  on  [0,4].  Find  the  total  area  between  the  curve 
and  the  x-axis  (measuring  all  area  as  positive).  =^ 

6.  Evaluate  the  three  integrals: 

p3  i>A  r3 

A=      (-x^  +  9)  dx       B  =      (-x^  +  9)dx       C  =      (-x^  +  9)  dx, 
Jo  Jo  Ji 

and  verify  that  A  =  B  +  C. 


8 

Techniques  of  Integration 


Over  the  next  few  sections  we  examine  some  techniques  that  are  frequently  successful  when 
seeking  antiderivatives  of  functions.  Sometimes  this  is  a  simple  problem,  since  it  will  be 
apparent  that  the  function  you  wish  to  integrate  is  a  derivative  in  some  straightforward 
way.  For  example,  faced  with 

x^^  dx 


we  realize  immediately  that  the  derivative  of  x^^  will  supply  an  x^'^:  {x^^)'  =  llx^'^.  We 
don't  want  the  "11",  but  constants  are  easy  to  alter,  because  differentiation  "ignores"  them 
in  certain  circumstances,  so 

From  our  knowledge  of  derivatives,  we  can  immediately  write  down  a  number  of  an- 
tiderivatives. Here  is  a  list  of  those  most  often  used: 


/ 


x''dx=  +  C,  ifny^-1 

n  +  1 


I 


X  ^  dx  =  hi\x\  +  C 
dx  =  e'  +  C 
sin  xdx  =  —  cos  x  -\-  C 
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COS  xdx  =  sin  x  +  C 
sec^  xdx  =  tan x  +  C 
sec  X  tan  xdx  =  sec  a;  +  C 
dx  =  arctan  x  +  C 

==  dx  =  arcsin  x  +  C 

x-^ 


I 
I 

I  ^ 


8.1  Substitution 

Needless  to  say,  most  problems  we  encounter  will  not  be  so  simple.  Here's  a  slightly  more 
complicated  example:  find 

J  2x  cos(a;^)  dx. 

This  is  not  a  "simple"  derivative,  but  a  little  thought  reveals  that  it  must  have  come  from 
an  application  of  the  chain  rule.  Multiplied  on  the  "outside"  is  2x,  which  is  the  derivative 
of  the  "inside"  function  x^.  Checking: 

—  sin(a;^)  =  cos(x^)—x'^  =  2a;cos(a;^), 
dx  dx 


J  2x  cos(a;^)  dx  =  sin(a;^)  +  C. 

Even  when  the  chain  rule  has  "produced"  a  certain  derivative,  it  is  not  always  easy  to 
see.  Consider  this  problem: 

There  are  two  factors  in  this  expression,  x^  and  a/I  —  x'^,  but  it  is  not  apparent  that  the 
chain  rule  is  involved.  Some  clever  rearrangement  reveals  that  it  is: 

/.3yr^..  =  /(-2.)(-i)(i-(i-.^))yr^... 

This  looks  messy,  but  we  do  now  have  something  that  looks  like  the  result  of  the  chain 
rule:  the  function  1  —  x^  has  been  substituted  into  — (1/2)(1  —  x)y/x,  and  the  derivative 
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of  1  multiplied  on  the  outside.  If  we  can  find  a  function  F{x)  whose  derivative 

is  — (1/2)(1  —  x)y^  we'll  be  done,  since  then 


d 

dx 


F{1  -  x^)  =  -2xF'{l  -  x^)  =  (-2x)  1^-0  (1  -  (1  -  x'^))^/l-x^ 


But  this  isn't  hard: 


J -^{1  -  x)^/x  dx  =  j -^{x^/^  -x^/^)dx  (8.1.1) 


So  finally  we  have 

J  x'Vl^dx  =  Q(l  -  x^)  -  0  (1  -  x^f/^  +  C. 

So  we  succeeded,  but  it  required  a  clever  first  step,  rewriting  the  original  function  so 
that  it  looked  like  the  result  of  using  the  chain  rule.  Fortunately,  there  is  a  technique  that 
makes  such  problems  simpler,  without  requiring  cleverness  to  rewrite  a  function  in  just  the 
right  way.  It  does  sometimes  not  work,  or  may  require  more  than  one  attempt,  but  the 
idea  is  simple:  guess  at  the  most  likely  candidate  for  the  "inside  function" ,  then  do  some 
algebra  to  see  what  this  requires  the  rest  of  the  function  to  look  like. 

One  frequently  good  guess  is  any  complicated  expression  inside  a  square  root,  so  we 
start  by  trying  u  =  1  —  x^,  using  a  new  variable,  m,  for  convenience  in  the  manipulations 
that  follow.  Now  we  know  that  the  chain  rule  will  multiply  by  the  derivative  of  this  inner 
function: 

du 

—  =  -2x, 
dx 

so  we  need  to  rewrite  the  original  function  to  include  this: 

/   r   2x  /"  x'^  du 

x^y\  —  x'^=  /  x^  Ju  dx  =  /  — Ju  —  dx. 

-2x  -2^  dx 


Recall  that  one  benefit  of  the  Leibniz  notation  is  that  it  often  turns  out  that  what  looks 
like  ordinary  arithmetic  gives  the  correct  answer,  even  if  something  more  complicated  is 
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going  on.  For  example,  in  Leibniz  notation  the  chain  rule  is 

dy  dy  dt 
dx      dt  dx 

The  same  is  true  of  our  current  expression: 

/x"^    /—  du  ^        [  x'^    /—  ^ 


Now  we're  almost  there:  since  u  =  l  —  x'^,x'^  =  l~u  and  the  integral  is 


y  -  ^(1  -  u)y/udu. 


It's  no  coincidence  that  this  is  exactly  the  integral  we  computed  in  (8.1.1),  we  have  simply 
renamed  the  variable  u  to  make  the  calculations  less  confusing.  Just  as  before: 


j -I{l-U)^du={^-U-^^  ^3/2  ^  (J 


Then  since  u  =  1  —  x^ 


j  x^^l^dx  =  Q(l  -  x^)  -  ^)  (1  -  x^/^  +  C. 

To  summarize:  if  we  suspect  that  a  given  function  is  the  derivative  of  another  via  the 
chain  rule,  we  let  u  denote  a  likely  candidate  for  the  inner  function,  then  translate  the 
given  function  so  that  it  is  written  entirely  in  terms  of  u,  with  no  x  remaining  in  the 
expression.  If  we  can  integrate  this  new  function  of  u,  then  the  antiderivative  of  the 
original  function  is  obtained  by  replacing  u  by  the  equivalent  expression  in  x. 

Even  in  simple  cases  you  may  prefer  to  use  this  mechanical  procedure,  since  it  often 
helps  to  avoid  silly  mistakes.  For  example,  consider  again  this  simple  problem: 

j  2x  cos(a;2)  dx. 

Let  u  =  x^,  then  du/dx  =  2x  or  du  =  2xdx.  Since  we  have  exactly  2xdx  in  the  original 
integral,  we  can  replace  it  by  du: 

j  2x  cos(a;2)  dx  =  J  cos  udu  =  sin u  +  C  =  sm{x'^)  +  C. 


This  is  not  the  only  way  to  do  the  algebra,  and  typically  there  are  many  paths  to  the 
correct  answer.  Another  possibility,  for  example,  is:  Since  du/dx  =  2x,  dx  =  du/2x,  and 
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then  the  integral  becomes 


du 


2x  cos(x'^)  dx  —  /  2a;  cos u —  =  /  cosudu. 

The  important  thing  to  remember  is  that  you  must  ehminate  all  instances  of  the  original 
variable  x. 

EXAMPLE  8.1.1  Evaluate  J {ax  +  b)^  dx,  assuming  that  a  and  b  are  constants,  0, 
and  n  is  a  positive  integer.  We  let  u  =  ax  +  b  so  du  =  a  dx  or  dx  =  du/a.  Then 


/  (ax  +  bT  dx=  [  -u^  du  =  —r^ — 
J  ^         '  J  a  a{n+l 


^1^+1  +C=  -r^—^ 

)  a{n+l) 


{ax  +  by+^  +  C. 


□ 


EXAMPLE  8.1.2  Evaluate  j  sin(ax  +  b)  dx,  assuming  that  a  and  b  are  constants  and 
a  7^  0.  Again  we  lei  u  —  ax  +  b  so  du  =  a  dx  or  dx  —  du/a.  Then 

r  [11  1 

/  sin(aa;  +  b)dx  =  /  -  sinudu  =  -(—  cos u)  +  C  =  —  cos(aa;  +  b)  +  C. 
J  J  a  a  a  n 


EXAMPLE  8.1.3  Evaluate  J  a;sin(a;  )  dx.  First  we  compute  the  antiderivative,  then 
evaluate  the  definite  integral.  Let  u  =  x^  so  du  =  2xdx  ot  xdx  =  du/2.  Then 

J  xsin{x^)  dx  —  J  -  sinudu  =  -(— cosw)  +  C  =  —-  cos(a;^)  +  C. 


Now 


i: 


X sm{x'^)  dx  =  — -cos(a;^) 


—  ^  cos(16)  +  ^  cos(4). 


A  somewhat  neater  alternative  to  this  method  is  to  change  the  original  limits  to  match 
the  variable  u.  Since  u  =  x? ,  when  a;  =  2,  tt  =  4,  and  when  a;  =  4,  tt  =  16.  So  we  can  do 
this: 


r 


X  sin(a;^)  dx 


16 


1 


sin  u  du 


1 


[zosu] 


16 


-  cos(16)  +  -  cos(4). 


An  incorrect,  and  dangerous,  alternative  is  something  like  this: 


4  ^4 
2^ 


X  sin  (a;  )  dx 


-  sin udu  —  —  -  cos(u) 


—  cosfa;^) 
2      ^  ' 


—  -  cos(16)  +  -  cos(4). 
z  z 


r  1 

k  2" 


This  is  incorrect  because  /    -  sin  m  du  means  that  u  takes  on  values  between  2  and  4,  which 


is  wrong.  It  is  dangerous,  because  it  is  very  easy  to  get  to  the  point  —  -  cos(«) 


and  forget 
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to  substitute  x"^  back  in  for  u,  thus  getting  the  incorrect  answer  —  -  cos(4)  +  -  cos(2).  A 
somewhat  clumsy,  but  acceptable,  alternative  is  something  like  this: 

1      ,  . 


X  sin(x  )  dx 


x=4 


x=2  2 


smudu=  — -cos(t(.) 


x=4 


COS  X 


x=2 


cos(16)  ^  cos(4) 


□ 


EXAMPLE  8.1.4  Evaluate 


1/2 


cos(7rt) 


dt.  Let  u  =  sin(7rt)  so  (iw  =  tt  cos(7rt)  dt  or 


'i/4  sin^(7rt) 

duji:  =  cos{nt)  dt.  We  change  the  limits  to  sin(7r/4)  =  and  sin(7r/2)  =  1.  Then 


1/2 


cos(7rt) 
1/4  sin^(7rt) 


dt 


V2/2 


1  1 


TV 


du 


1 

v'2/2  TT 


-u  du 


1  u-^ 


TT  -1 


2/2 


1 

--  +   . 

TT  TT 


□ 


Exercises  8.1. 

Find  the  antiderivatives. 


1 
3 
5. 
7. 
9. 


sin"^  a;  cos  x  dx 


dx 


x-^ 


sma; 


dx 


/  x  sec  {x  )  tan(a:  )  dx 
Jo 


11.     /    sin  (3a;)  cos(3x)  dx 

AF/2 


15. 


17. 


19. 


1 


(3x  -  7)2 
6x 

(x2  -  7)1/9 
L 

sin a;  dx  = 


dx  = 
dx 


2. 

4. 

6. 

8. 
10. 
12. 

14. 
16. 
18. 
20. 


'x^  +  1)^  dx 


dt 


^/l^t 


cos(7rt)  cos^sin(7rt)  j  dt 

tan  X  dx  =^> 

sec^  X  tan  x  dx  =^ 
sin(tanx) 


dx 


7r/6 


(cos  X  —  sin  x)  dx 
y  (2x^  -  l)(x^  -  2x)'^dx 
f{x)f{x)dx 
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8.2    Powers  of  sine  and  cosine 

Functions  consisting  of  products  of  the  sine  and  cosine  can  be  integrated  by  using  substi- 
tution and  trigonometric  identities.  These  can  sometimes  be  tedious,  but  the  technique  is 
straightforward.  Some  examples  will  suffice  to  explain  the  approach. 

EXAMPLE  8.2.1    Evaluate  y"  sin^  a;  dx.  Rewrite  the  function: 

sin^  xdx  =  J  sin  x  sin^  xdx  =  J  sin  a;(sin^  x)^  dx  =  J  sin  a;(l  —  cos^  x)'^  dx. 

Now  use  u  =  cos  x,  du  =  —  sin  x  dx: 

J  smx{l  -  cos^  x)'^  dx  =  J -{1  -  u^)"^  du 

=  J -{l-2u^  +u'^)du 

=  -u+  -W^  -  -u^  +  C 
0  5 

2315^ 
=  —  COS  X  +  -  COS  X  —  -  COS  X  +  C. 

3  5 


□ 


EXAMPLE  8.2.2  Evaluate  J  sin^a;cia;.  Use  sin^a;  =  (1  -  cos(2a;))/2  to  rewrite  the 
function: 

/"  •  6    ^            •  2   x3^        f  (l-cos2a;)3  ^ 
/  sm  xdx  —  /  (sm  x)  dx  =  dx 


1  —  3  cos  2x  +  3  cos^  2x  —  cos^  2x  dx 


Now  we  have  four  integrals  to  evaluate: 
and 


3 

-3  cos  2xdx  =  —  sin  2x 
2 
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are  easy.  The  cos'^  2x  integral  is  like  the  previous  example: 

-  cos^  2xdx  =  I  —  cos  2x  cos^  2x  dx 


—  j  —  cos2x{l  —  siv?  2x)  dx 


-]^{l-u')du 


1  /  u 


2  V"~T 


1  /  sin^2a; 
sin  2a; 


2  V  3 

And  finally  we  use  another  trigonometric  identity,  cos^  x  =  (1  +  cos(2x))/2: 

2                /"l  +  cos4x  3  /  sin4x 

3  cos  2x  ax  =  3  /  ax  =  -    x  + 


2  2  V  4 


So  at  long  last  we  get 


fj  X      3  If  sin"^2x\      3  /       sin4x  \ 

sm  X  ax  =  -  -  —  sm 2x  -  —    sm 2x  —  ]  +  t7^  \  x  ^  —  |  +  C. 

8      lb  lb  V  3      /      lb  V  4 


□ 


EXAMPLE  8.2.3  Evaluate  j sm^  xcos^  xdx.  Use  the  formulas  sin^  x  =  (l-cos(2x))/2 
and  cos^  x  =  (1  +  cos(2x))/2  to  get: 


sin^  X  cos'^  xdx  =  I   ^ — -  ■  — '  ^ — -  dx. 

J  2  2 

The  remainder  is  left  as  an  exercise.  □ 

Exercises  8.2. 

Find  the  antiderivatives. 
1. 


1  —  cos(2x)    1  +  cos(2x) 


3.     /  sin'*a;dx=>  4.     /  <zo^  x^\\^  xdx 


5.     /  cos^a;dx=>  6.    y  sin^  a;  cos^  x  dx 

7.     /  cos^  a;  sin^  X      =^>  8.     /  sin  x(cos  x)^^^  dx 


9.     /  sec^  X  csc^  X  dx  10.     /  tan^xsecxdx 


8.3    Trigonometric  Substitutions  173 


8.3    Trigonometric  Substitutions 

So  far  we  have  seen  that  it  sometimes  helps  to  replace  a  subexpression  of  a  function  by 
a  single  variable.  Occasionally  it  can  help  to  replace  the  original  variable  by  something 
more  complicated.  This  seems  like  a  "reverse"  substitution,  but  it  is  really  no  different  in 
principle  than  ordinary  substitution. 

EXAMPLE  8.3.1    Evaluate  J  \J\  —     dx.  Let  a;  =  sintt  so  dx  =  cosudu.  Then 

j  y/l  —  x"^  dx  =  J  V 1  —  sin^  u cos udu  =  J  Vcos^  u cosudu. 

We  would  like  to  replace  V cos^  u  by  cosu,  but  this  is  valid  only  if  costt  is  positive,  since 
V cos^  u  is  positive.  Consider  again  the  substitution  x  =  sin  u.  We  could  just  as  well  think 
of  this  as  u  =  arcsinx.  If  we  do,  then  by  the  definition  of  the  arcsine,  — 7r/2  <u<  7r/2,  so 
costt  >  0.  Then  we  continue: 

f  I  5—         ,        /"      2     ,        /■  1  +  cos  2^  ,       u     sin  2m  ^ 

/  V  cos^  u  cos  udu  =  /  cos  udu  =  /  du  =  —  -\  h  C 

J  J  7224 

arcsina;     sin(2  arcsinx)  ^ 

=  1  ^   +  C. 

2  4 

This  is  a  perfectly  good  answer,  though  the  term  sin(2  arcsins)  is  a  bit  unpleasant.  It  is 
possible  to  simplify  this.  Using  the  identity  sin  2a:  =  2sin,Tcos,T,  wc  can  write  sin2'U  = 

2 sin M cos w  =  2 sin(arcsin x)^  1  —  sin^ u  =  2x\Jl  —  sin^(arcsina;)  =  2x\/l  —  x"^.  Then  the 
full  antiderivative  is 

arcsina;     2x^/1  —  x"^      arcsina;     x^/l  —  x^  ^ 

 \  =  \  V  C. 

2  4  2  2 

□ 

This  type  of  substitution  is  usually  indicated  when  the  function  you  wish  to  integrate 
contains  a  polynomial  expression  that  might  allow  you  to  use  the  fundamental  identity 
sin^  X  +  cos^  a;  =  1  in  one  of  three  forms: 

cos^  a;  =  1  —  sin^  x       sec^  a;  =  1  +  tan^  x       tan^  x  =  sec^  a;  —  1. 

If  your  function  contains  1  —  x^,  as  in  the  example  above,  try  x  =  sin-u;  if  it  contains  1  +  a;^ 
try  X  =  tanw;  and  if  it  contains  —  1,  try  x  =  seen.  Sometimes  you  will  need  to  try 
something  a  bit  different  to  handle  constants  other  than  one. 
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EXAMPLE  8.3.2  Evaluate  j  4  -  9x'^  dx.  We  start  by  rewriting  this  so  that  it  looks 
more  like  the  previous  example: 

j  ^4  -  9a;2  dx  =  J  ^/^(r^^O^^TsF)  f^^;  =  J  2^1  -  (3x/2)2dx. 

Now  let  3x/2  =  sinw  so  (3/2)  (ix  =  cosudu  or  dx  =  (2/3)  cosudu.  Then 

j  2^1  -  (3x/2)2dx  =  y  2\/l  -  sin^  w(2/3)  cosudu=^j  cos^  udu 

4u     4  sin  2u  ^ 

=  y  +  — +^ 

2  arcsin(3a;/2)     2  sin  m  cos  m  ^ 

=  ^ — —  H  h  C 

3  3 

2  arcsin(3x/2)  ^  2  sin(arcsin(3x/2))  cos(arcsin(3x/2)) 


_  2 arcsin(3x/2)     2{?,x/2)^l  -  (3x/2)2 

-  3  +  3  +^ 

2  arcsin(3x/2)     xV4  —  9x2 

=  ^ — ^  H  h  C, 

3  2 

using  some  of  the  work  from  example  8.3.1.  □ 
EXAMPLE  8.3.3    Evaluate  j  a/i  +  x2  dx.  Let  x  =  tanw,  dx  =  sec^  udu,  so 

Since  m  =  arctan(x),  — 7r/2  <  u  <  ti/2  and  secw  >  0,  so  V sec2  u  =  secw.  Then 

V sec2  u  sec^  udu  =  /  sec'^  w  dw. 


M  problem,  of  thi.  type,  two  integral,  come  up  frequently:  /.ec'u*  and  f^cudu. 
Both  have  relatively  nice  expressions  but  they  are  a  bit  tricky  to  discover. 
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First  we  do  J  secudu,  which  we  will  need  to  compute  J  sec^  udu: 


,         sec  u  +  tan  u 

sec  udu  —  I  sec  u  du 

sec  u  +  tan  u 


-I 


sec^  u  +  sec  u  tan  u 
sec  u  +  tan  u 


du. 


Now  let  w  =  secu  +  tantt,  dw  =  sec  tt  tan  it  +  sec^  udu,  exactly  the  numerator  of  the 
function  we  are  integrating.  Thus 

/f  sec^  u  +  sec  u  tan  u  /"  1   ,       i   i    i  ^ 

sec  udu  =  /   du  =      —  dw  =  In  ku  +  C 
J       sec  u  +  tan  w  J  w 

=  In  I  sec  u  +  tan  w|  +  C. 

Now  for  /  sec^  udu: 


o  sec^  u  sec^  u  sec^  u  (tan  u+1)  sec  m 
sec^  u  =  —  \  —  =  —  h  ^  — ^  


2  2  2 

sec  u  tan^  w 

+  ^  +         =   ^  + 


sec^  u     sec  u  tan^  u     sec  u     sec^  tt  +  sec  u  tan^  w     sec  u 


We  already  know  how  to  integrate  sec  u,  so  we  just  need  the  first  quotient.  This  is  "simply" 
a  matter  of  recognizing  the  product  rule  in  action: 

sec^  u  +  sec u tan^  udu  =  sec u tan u. 


So  putting  these  together  we  get 


o            secittantt     In  I  sectt  +  tanttl 
sec  udu  =  1  h  6 , 


and  reverting  to  the  original  variable  x: 

/r           ,       sec  « tan M     In  I  sec«  +  tanul  „ 
VI  ^x^dx=   ^  +   ^  ^  +  C 

sec(arctan  a;)  tan(arctan  x)     In  |  sec(arctan  x)  +  tan(arctan  a;)  | 
=  2  +  2  +^ 

+         In  \\J\  +     +  a;| 
=        2       +  2 

using  tan(arctana;)  =  x  and  sec(arctana;)  =  -^1  +  tan^ (arctan a;)  =  -x/l  +  n 
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Exercises  8.3. 

Find  the  antiderivatives. 
1.       cscxdx  2.     /  CSC  xdx 


3.    j  \Jx^  -Idx  =^  4.  J  \/9  +  4a;2  dx  = 

7.     /    ,   ^       dx  ^  8.     /  \/a;2  +  2a;  dx  = 

J  ^/Y^  J 

9.     /  -^r-^ — ^  dx  10.     /    ,  ^  dx 


.3 


11.     I  ^^dx^  12.     /  . 


a/1  -  a;  y   a/4x2  -  1 

8.4    Integration  by  Parts 

We  have  already  seen  that  recognizing  the  product  rule  can  be  useful,  when  we  noticed 
that 

sec"^  u  +  sec  u  tan^  udu  =  sec  u  tan  u. 


As  with  substitution,  we  do  not  have  to  rely  on  insight  or  cleverness  to  discover  such 
antiderivatives;  there  is  a  technique  that  will  often  help  to  uncover  the  product  rule. 
Start  with  the  product  rule: 

d 


We  can  rewrite  this  as 


=  f'{x)g{x)  +  f{x)g'{x). 
f{x)g{x)  =  j  f'{x)g{x)dx  +  J  f{x)g'{x)dx, 

and  then 

j  f{x)g'{x)dx  =  f{x)g{x)  -  j  f'{x)g{x)dx. 

This  may  not  seem  particularly  useful  at  first  glance,  but  it  turns  out  that  in  many  cases 
we  have  an  integral  of  the  form 

f{x)g'{x)dx 


but  that 

f'{x)g{x)dx 


is  easier.  This  technique  for  turning  one  integral  into  another  is  called  integration  by 
parts,  and  is  usually  written  in  more  compact  form.  If  we  let  u  =  f{x)  and  v  =  g{x)  then 
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du  =  f'{x)  dx  and  dv  =  g'{x)  dx  and 

j  udv  =  uv  —  j  V  du. 

To  use  this  technique  we  need  to  identify  hkely  candidates  for  u  =  f{x)  and  dv  =  g'{x)  dx. 

EXAMPLE  8.4.1  Evaluate  J  xlnxdx.  Let  u  =  Inx  so  du  =  1/xdx.  Then  we  must 
let  dv  =  xdx  so  V  =     jl  and 


x^\\\.x      f  x^  1  x^liix      fx  x^lnx  x"^ 

xdx  =  —  /  dx  =  —  /  —  ax  =   +  C. 

2  X  2         /  2  2  4 


□ 


EXAMPLE  8.4.2  Evaluate  J  xsinxdx.  Let  u  =  x  so  du  =  dx.  Then  we  must  let 
dv  =  sin  xdx  so  V  =  —  cos  x  and 

J  X  sin  xdx  =  —x  cos  x  —  J  —  cos  xdx  =  —x  cos  ^    J      xdx  =  —x  cos  x  +  sin  x  +  C. 

EXAMPLE  8.4.3    Evaluate  J  sec^  x  dx.  Of  course  we  already  know  the  answer  to  this, 

but  we  needed  to  be  clever  to  discover  it.  Here  we'll  use  the  new  technique  to  discover  the 
antiderivative.  Let  u  =  sec  a;  and  dv  =  sec^  xdx.  Then  du  =  sec  a;  tan  a;  and  v  =  tana;  and 

J  sec^  xdx  =  sec x  tan ^  ~  J  ^^^^  ^ ^ 

=  sec  X  tan  x  —  J  (sec^  a;  —  1)  sec  x  dx 
=  sec X tan x  —     sec^ xdx  +  /  sec x dx. 

J  sec^  xdx  =  sec  x  tan  ^  ~  J       xdx  +  J  sec  x  dx 
J  sec^  xdx  +  J  sec^  xdx  =  sec  x  tan  ^     J  ^ 

2  /  sec"^  xdx  =  sec  x  tan  x  +     sec  x  dx 


n            sec  a;  tan  a;     1  f 
sec  xdx  =  2  / 

sec  x  tan  a;     In  I  sec  a;  +  tana;!  ^ 

=  — ^ —  +  —  ^          +  c. 


□ 
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EXAMPLE  8.4.4    Evaluate  J  x^smxdx.  Let  u  =  x^,  dv  =  sinxdx;  then  du  =  2xdx 


and  V  =  —cosx.  Now  /  x^sinxdx  =  —x  cosa;+  /  2a;  cos  a;  dx.  This  is  better  than  the 


J  x^  sin  xdx  =  —x^  cos  x  +  J '. 

original  integral,  but  we  need  to  do  integration  by  parts  again.  Let  u  =  2x,  dv  =  cos  xdx; 
then  du  =  2  and  v  =  sin  a;,  and 


/ 


X  sin  xdx  =  —x  cos  x  +     2x  cos  x  dx 


=  —x^  COS  a;  +  2a;  sin  x  —  I  2  sin  x  dx 
=  — COS  a;  +  2a;  sin  a;  +  2  cos  x  +  C. 


□ 


Such  repeated  use  of  integration  by  parts  is  fairly  common,  but  it  can  be  a  bit  tedious  to 
accomplish,  and  it  is  easy  to  make  errors,  especially  sign  errors  involving  the  subtraction  in 
the  formula.  There  is  a  nice  tabular  method  to  accomplish  the  calculation  that  minimizes 
the  chance  for  error  and  speeds  up  the  whole  process.  We  illustrate  with  the  previous 
example.  Here  is  the  table: 


sign 

u 

dv 

u 

dv 

x^ 

sin  a; 

x^ 

sin  a; 

2x 

—  cos  a; 

or 

-2x 

—  cos  a; 

2 

—  sin  a; 

2 

—  sin  a; 

0 

cos  a; 

0 

cosx 

To  form  the  first  table,  we  start  with  u  at  the  top  of  the  second  column  and  repeatedly 
compute  the  derivative;  starting  with  dv  at  the  top  of  the  third  column,  we  repeatedly 
compute  the  antiderivative.  In  the  first  column,  we  place  a  "— "  in  every  second  row.  To 
form  the  second  table  we  combine  the  first  and  second  columns  by  ignoring  the  boundary; 
if  you  do  this  by  hand,  you  may  simply  start  with  two  columns  and  add  a  "— "  to  every 
second  row. 

To  compute  with  this  second  table  we  begin  at  the  top.  Multiply  the  first  entry  in 
column  u  by  the  second  entry  in  column  dv  to  get  — a;^ cosx,  and  add  this  to  the  integral 
of  the  product  of  the  second  entry  in  column  u  and  second  entry  in  column  dv.  This  gives: 


/ 


X  COSX+  /  2x  COS  xdx. 


or  exactly  the  result  of  the  first  application  of  integration  by  parts.  Since  this  integral  is 
not  yet  easy,  we  return  to  the  table.  Now  we  multiply  twice  on  the  diagonal,  (x^)(—  cosx) 
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and  (— 2x)(— sinx)  and  then  once  straight  across,  (2)(— sinx),  and  combine  these  as 

— x^  cosx  +  2xsinx  —  /  2sinx(ix, 


giving  the  same  result  as  the  second  apphcation  of  integration  by  parts.  While  this  integral 
is  easy,  we  may  return  yet  once  more  to  the  table.  Now  multiply  three  times  on  the  diagonal 
to  get  (x^)(— cosx),  (— 2x)(— sinx),  and  (2)(cosx),  and  once  straight  across,  (O)(cosx). 
We  combine  these  as  before  to  get 

— x^  cos  X  +  2x  sin  x  +  2  cos  x  +  /  Odx  =  —x'^  cos  x  +  2x  sin  x  +  2  cos  x  +  C. 


Typically  we  would  fill  in  the  table  one  line  at  a  time,  until  the  "straight  across"  multipli- 
cation gives  an  easy  integral.  If  we  can  see  that  the  u  column  will  eventually  become  zero, 
we  can  instead  fill  in  the  whole  table;  computing  the  products  as  indicated  will  then  give 
the  entire  integral,  including  the  "-I-C  " ,  as  above. 

Exercises  8.4- 

Find  the  antiderivatives. 
1.   J  xcosxdx^  2.    J  x^cosxdx^ 

3.     /  xe^  4.     /  xe^  dx 


5.    j  sin'xdx  =^  6.    J  Inx  dx  ^ 

7.     /  x  arctan  a;  da;  =^  8.     /  x^sinxdx 


9.     /  x^cosxdx^  10.     /  xsin^xdx 


11.    J  X  sin  X  cos  a;      =^  12.    J  arctan(-yx)  da;  =^ 

13.    J  sin(v^)  dx  =^  14.   J  sec^  x  csc^  x  dx 

8.5    Rational  Functions 

A  rational  function  is  a  fraction  with  polynomials  in  the  numerator  and  denominator. 
For  example, 

x"^  1  x^  + 1 

x^-f-x  — 6'  (x  — 3)2'  x^  —  1 ' 

are  all  rational  functions  of  x.  There  is  a  general  technique  called  "partial  fractions" 
that,  in  principle,  allows  us  to  integrate  any  rational  function.  The  algebraic  steps  in  the 
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technique  are  rather  cumbersome  if  the  polynomial  in  the  denominator  has  degree  more 
than  2,  and  the  technique  requires  that  we  factor  the  denominator,  something  that  is  not 
always  possible.  However,  in  practice  one  does  not  often  run  across  rational  functions  with 
high  degree  polynomials  in  the  denominator  for  which  one  has  to  find  the  antidcrivative 
function.  So  we  shall  explain  how  to  find  the  antiderivative  of  a  rational  function  only 
when  the  denominator  is  a  quadratic  polynomial  ax^  +  6x  +  c. 

We  should  mention  a  special  type  of  rational  function  that  we  already  know  how  to 
integrate:  If  the  denominator  has  the  form  {ax  +  6)",  the  substitution  u  =  ax  +  h  will 
always  work.  The  denominator  becomes  ,  and  each  x  in  the  numerator  is  replaced  by 
{u  —  b)/a,  and  dx  —  du/a.  While  it  may  be  tedious  to  complete  the  integration  if  the 
numerator  has  high  degree,  it  is  merely  a  matter  of  algebra. 

EXAMPLE  8.5.1    Find  /  j- — — — ^  dx.  Using  the  substitution  u  =  3  -  2a;  we  get 
x^       ^        If  (^)    ,        1    /■  ^3  _  9^2  ^  21u  -  27  , 

dx  =    /    ^  ^ —  dU  =  —    I   dU 

16  7 


{2,-2xf  -2  J  16  J 

1 


16 


-  9w-3  +  27w-^  -  27^-^  du 


1  fu-^     9u-2     27u-3     21u-^\  ^ 

+  — ^  ^  +c 


16  V  -1       -2         -3  -4 

1  /(3-2a;)-^     9(3  -  2a;)-2  ^  27(3  -  2a;)-3     27(3  -  2a;)" 


16  V       -1  -2  -3  -4 

1  9  9  27  ^ 


16(3 -2a;)     32(3  -  2a;)2      16(3  -  2a;)3     64(3  -  2x)^  ^ 

We  now  proceed  to  the  case  in  which  the  denominator  is  a  quadratic  polynomial.  We 
can  always  factor  out  the  coefficient  of  x^  and  put  it  outside  the  integral,  so  we  can  assume 
that  the  denominator  has  the  form  x"^  -\-hx-\-  c.  There  are  three  possible  cases,  depending 
on  how  the  quadratic  factors:  either  x^  +  bx  +  c  =  (x  —  r)(x  —  s),  x"^  +  bx  +  c  =  (a;  —  r)^, 
or  it  doesn't  factor.  We  can  use  the  quadratic  formula  to  decide  which  of  these  we  have, 
and  to  factor  the  quadratic  if  it  is  possible. 

EXAMPLE  8.5.2  Determine  whether  x'^  +  x  +  1  factors,  and  factor  it  if  possible.  The 
quadratic  formula  tells  us  that  a;^  +  a;  +  1  =  0  when 

-i±  v^r^ 

^=  2  • 

Since  there  is  no  square  root  of  —3,  this  quadratic  does  not  factor.  □ 
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EXAMPLE  8.5.3  Determine  whether  x'^  —  x  —  1  factors,  and  factor  it  if  possible.  The 
quadratic  formula  tells  us  that      —  x  —  1  =  0  when 

1  ±  v/rT4    1  ± 

X  =   =  — - — . 


Therefore 


x^  —  X  —  1  —  \  X 


□ 

If  x'^  +  bx  +  c  =  (x  —  r)^  then  we  have  the  special  case  we  have  already  seen,  that  can 
be  handled  with  a  substitution.  The  other  two  cases  require  different  approaches. 
If  x^  +  6x  +  c  =  (x  —  r)(x  —  s),  we  have  an  integral  of  the  form 

dx 


(x  —  r)(x 

where  p{x)  is  a  polynomial.  The  first  step  is  to  make  sure  that  p{x)  has  degree  less  than 
2. 

 —   dx  in  terms  of  an  integral  with  a  numerator 

(x-2)(x  +  3) 

that  has  degree  less  than  2.  To  do  this  we  use  long  division  of  polynomials  to  discover  that 
x"^  x^  7x  —  6  7x  —  6 

X  —  1  H  t;   =  X  —  1  + 


(x-2)(x  +  3)      x2  +  x-6  x2+x-6  (x-2)(x  +  3)' 

so 

J  f       1  J        f       7x  -  6 

dx  =  /  X  —  lax+  /  -.  -—  —  dx. 


x-2)(x  +  3)         J  J  (x-2)(x  +  3) 

The  first  integral  is  easy,  so  only  the  second  requires  some  work.  □ 
Now  consider  the  following  simple  algebra  of  fractions: 

A    ^    B    _  A{x  -  s)  +  B{x  -  r)  _  {A  + B)x  -  As  -  Br 


x  —  r      x  —  s  (x  — r)(x  — s)  (x  — r)(x  — s) 

That  is,  adding  two  fractions  with  constant  numerator  and  denominators  (x  —  r)  and  (x  —  s) 
produces  a  fraction  with  denominator  (x  —  r)  (x  —  s)  and  a  polynomial  of  degree  less  than 
2  for  the  numerator.  We  want  to  reverse  this  process:  starting  with  a  single  fraction,  we 
want  to  write  it  as  a  sum  of  two  simpler  fractions.  An  example  should  make  it  clear  how 
to  proceed. 
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/x^  Tx  —  6 

7  —.  r  dx.  We  start  by  writing  ^7  r 

[x  -2){x  +  3)  [x  -2){x  +  3) 

as  the  sum  of  two  fractions.  We  want  to  end  up  with 

7x-6             A  B 
+ 


(a;-2)(a;  +  3)      x-2     x  +  3' 

If  we  go  ahead  and  add  the  fractions  on  the  right  hand  side  we  get 

7x-6       _  {A  +  B)x  +  3A-  2B 
{x-2){x  +  3)  ~       {x  -  2){x  +  3)  ■ 

So  all  we  need  to  do  is  find  A  and  B  so  that  7x  —  6  =  {A  +  B)x  +  3 A  —  2B,  which  is  to 
say,  we  need  7  =  A  +  B  and  —6  =  3A  —  2B.  This  is  a  problem  you've  seen  before:  solve  a 
system  of  two  equations  in  two  unknowns.  There  are  many  ways  to  proceed;  here's  one:  If 
7  =  A  +  B  then  B  =  7- A  and  so  -6  =  3A-2B  =  3A-2{7-A)  =  3A-14  +  2A  =  5A-14. 
This  is  easy  to  solve  for  A:  A  =  8/5,  and  then  5  =  7-  ^  =  7-  8/5  =  27/5.  Thus 

7x-6        ,        [  8    1        27    1      ,       8,   ,        ,     27,   ,        ,  ^ 

ax  =   +  —   ax  —  -  m  a;  —  2  +  —  m  a;  +  3  +  C. 


(a;-2)(a;  +  3)         J  5x-2      5x  +  3  5'        '  5 

The  answer  to  the  original  problem  is  now 

x^  f  f       7x  —  6 

dx  —     X  —  Idx  +  /  —  —  dx 


(a;-2)(a;  +  3)         J  J  (a;-2)(a;  +  3) 

8,   ,       ^,     27,   ,  ^ 
=  —  -  a;  +  -  In  |x  -  2|  +  —  In  |x  +  3|  +  C. 
2  5  5 

□ 

Now  suppose  that  x'^  +  bx  +  c  doesn't  factor.  Again  we  can  use  long  division  to  ensure 
that  the  numerator  has  degree  less  than  2,  then  we  complete  the  square. 

/x+1 
-T,  :          dx.  The  quadratic  denominator  does  not 
a;2  +  4a;  +  8  ^ 

factor.  We  could  complete  the  square  and  use  a  trigonometric  substitution,  but  it  is  simpler 
to  rearrange  the  integrand: 

J  x"^ +  4:X  +  8^^  ~  J  x"^ +4x  +  8^^     J  x'^  +  4x  +  8^^' 
The  first  integral  is  an  easy  substitution  problem,  using  tt  =  a;^  +  4a;  +  8: 


/ 


X  +  2       ,       1   [  du      1,1  2 


a;2  +  4a;  +  8  2 


dx=  -  I  —  =  -  In  la;^  +  4a;  +  81. 


u 
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For  the  second  integral  we  complete  the  square: 


+  4a;  +  8  =  (a;  +  2)2  +  4  =  4 


x  +  2 


+  1 


making  the  integral 


x  +  2 

Using  u  =  — - —  we  get 


4y  i^-^Y  +  i 


dx. 


4J   {^^Y  +  l 
The  final  answer  is  now 


dx 


2      ,       1  fx  +  2 

 dx  =  -  arctan   

+  1  2  \  2 


 ax  =  -  in  X  +  4x  +  8  arctan    +  G. 

x2  +  4x  +  8  2'  '2  V2' 


□ 


Exercises  8.5. 

Find  the  antiderivatives. 
1 


1 
3 
5. 
7. 
9. 


4  -  a;2 


dx 


.     f  1 

J  + 


X 


4  +  a;2 

,3 


x  +  25 


dx 


dx 


X 


4  +  a;2 
1 


dx 


2x^  —  X  —  3 


dx 


2. 
4. 
6. 
8 
10. 


X 


4  -  a;2 

.2 


X 


4  - 


dx 


dx 


1 


a;2  +  10a;  +  29 


dx 


.     [  1 

J  2;2  +  10 


a;-^  +  3a; 


10a;  +  21 
da; 


dx 
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We  have  now  seen  some  of  the  most  generally  useful  methods  for  discovering  antiderivatives, 
and  there  are  others.  Unfortunately,  some  functions  have  no  simple  antiderivatives;  in  such 
cases  if  the  value  of  a  definite  integral  is  needed  it  will  have  to  be  approximated.  We  will 
see  two  methods  that  work  reasonably  well  and  yet  are  fairly  simple;  in  some  cases  more 
sophisticated  techniques  will  be  needed. 
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Of  course,  we  already  know  one  way  to  approximate  an  integral:  if  we  think  of  the 
integral  as  computing  an  area,  we  can  add  up  the  areas  of  some  rectangles.  While  this 
is  quite  simple,  it  is  usually  the  case  that  a  large  number  of  rectangles  is  needed  to  get 
acceptable  accuracy.  A  similar  approach  is  much  better:  we  approximate  the  area  under  a 
curve  over  a  small  interval  as  the  area  of  a  trapezoid.  In  figure  8.6.1  we  see  an  area  under 
a  curve  approximated  by  rectangles  and  by  trapezoids;  it  is  apparent  that  the  trapezoids 
give  a  substantially  better  approximation  on  each  subinterval. 


Figure  8.6.1     Approximating  an  area  with  rectangles  and  with  trapezoids. 


As  with  rectangles,  we  divide  the  interval  into  n  equal  subintervals  of  length  Ax.  A 

ical  trapezoid  is  pictured  in  fi^ 
the  areas  of  all  trapezoids  we  get 


typical  trapezoid  is  pictured  in  figure  8.6.2;  it  has  area  ■^('^^-^    ^^''^^^^^  /\x.  If  we  add  up 


f{xo)  +  f{x 


i^Ax  + 


/(xi)  +  /(^^^  +  ■  ■  ■  +  /(^n-l)  +  ^ 


2  2 

+  /(Xl)  +  /(X2)  +  •  •  •  +  /(X.-I)  +  ^ 


Ax. 


This  is  usually  known  as  the  Trapezoid  Rule.  For  a  modest  number  of  subintervals  this 
is  not  too  difficult  to  do  with  a  calculator;  a  computer  can  easily  do  many  subintervals. 


{Xi,f{Xi)) 


Figure  8.6.2     A  single  trapezoid. 
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In  practice,  an  approximation  is  useful  only  if  we  know  how  accurate  it  is;  for  example, 
we  might  need  a  particular  value  accurate  to  three  decimal  places.  When  we  compute  a 
particular  approximation  to  an  integral,  the  error  is  the  difference  between  the  approxi- 
mation and  the  true  value  of  the  integral.  For  any  approximation  technique,  we  need  an 
error  estimate,  a  value  that  is  guaranteed  to  be  larger  than  the  actual  error.  If  A  is  an 
approximation  and  E  is  the  associated  error  estimate,  then  we  know  that  the  true  value 
of  the  integral  is  between  A  —  E  and  A  +  E.  In  the  case  of  our  approximation  of  the 
integral,  we  want  E  —  E{/S.x)  to  be  a  function  of  Ax  that  gets  small  rapidly  as  Ax  gets 
small.  Fortunately,  for  many  functions,  there  is  such  an  error  estimate  associated  with  the 
trapezoid  approximation. 

THEOREM  8.6.1  Suppose  /  has  a  second  derivative  /"  everywhere  on  the  interval 
[a,  6],  and  <  M  for  all  x  in  the  interval.  With  Ax  =  {b  —  a)/n,  an  error  estimate 

for  the  trapezoid  approximation  is 


Let's  see  how  we  can  use  this. 

EXAMPLE  8.6.2    Approximate  /    e  ^  dx  to  two  decimal  places.  The  second  deriva- 

Jo 

tive  of  /  =  e"""  is  (4x^  - 2)e~^  ,  and  it  is  not  hard  to  see  that  on  [0, 1] ,  |  (4x^  - 2)e~^  |  <  2. 
We  begin  by  estimating  the  number  of  subintervals  we  are  likely  to  need.  To  get  two  dec- 
imal places  of  accuracy,  we  will  certainly  need  E{Ax)  <  0.005  or 

1(2)-!  <  0.005 
^(200) 


5.77  w  \  ——  <  n 


With  ?i  =  6,  the  error  estimate  is  thus  1/6"^  <  0.0047.  We  compute  the  trapezoid  approxi- 
mation for  six  intervals: 

+  /(1/6)  +  /(2/6)  +  •  •  •  +  /(5/6)  +  ^)  I  ^  0.74512. 

So  the  true  value  of  the  integral  is  between  0.74512  -  0.0047  =  0.74042  and  0.74512  + 
0.0047  =  0.74982.  Unfortunately,  the  first  rounds  to  0.74  and  the  second  rounds  to  0.75, 
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SO  we  can't  be  sure  of  the  correct  value  in  the  second  decimal  place;  we  need  to  pick  a  larger 
n.  As  it  turns  out,  we  need  to  go  to  n  =  12  to  get  two  bounds  that  both  round  to  the  same 
value,  which  turns  out  to  be  0.75.  For  comparison,  using  12  rectangles  to  approximate 
the  area  gives  0.7727,  which  is  considerably  less  accurate  than  the  approximation  using  six 
trapezoids. 

In  practice  it  generally  pays  to  start  by  requiring  better  than  the  maximum  possible 
error;  for  example,  we  might  have  initially  required  E{Ax)  <  0.001,  or 

1(2)-L<0.001 

-(1000)  < 
6 


12.91  ^W^<n 

Had  we  immediately  tried  n  —  13  this  would  have  given  us  the  desired  answer.  □ 

The  trapezoid  approximation  works  well,  especially  compared  to  rectangles,  because 
the  tops  of  the  trapezoids  form  a  reasonably  good  approximation  to  the  curve  when  Ax  is 
fairly  small.  We  can  extend  this  idea:  what  if  we  try  to  approximate  the  curve  more  closely, 
by  using  something  other  than  a  straight  line?  The  obvious  candidate  is  a  parabola:  if  we 
can  approximate  a  short  piece  of  the  curve  with  a  parabola  with  equation  y  =  ax^  +  6x  +  c, 
we  can  easily  compute  the  area  under  the  parabola. 

There  are  an  infinite  number  of  parabolas  through  any  two  given  points,  but  only 
one  through  three  given  points.  If  we  find  a  parabola  through  three  consecutive  points 
(xi,/(xi)),  (xi_|_i, /(xi+i)),  (xi+25 /(a;i+2))  on  the  curve,  it  should  be  quite  close  to  the 
curve  over  the  whole  interval  [xi,Xi-(-2],  as  in  figure  8.6.3.  If  we  divide  the  interval  [a,  6] 
into  an  even  number  of  subintervals,  we  can  then  approximate  the  curve  by  a  sequence  of 
parabolas,  each  covering  two  of  the  subintervals.  For  this  to  be  practical,  we  would  like  a 
simple  formula  for  the  area  under  one  parabola,  namely,  the  parabola  through  (x^,  /(x^)), 
(xi+i,  /(xi-i-i)),  and  (xi-|-2,  /(a^j+2))-  That  is,  we  should  attempt  to  write  down  the  parabola 
y  =  ax^  +  6x  +  c  through  these  points  and  then  integrate  it,  and  hope  that  the  result  is 
fairly  simple.  Although  the  algebra  involved  is  messy,  this  turns  out  to  be  possible.  The 
algebra  is  well  within  the  capability  of  a  good  computer  algebra  system  like  Sage,  so  we 
will  present  the  result  without  all  of  the  algebra;  you  can  see  how  to  do  it  in  this  Sage 
worksheet. 

To  find  the  parabola,  we  solve  these  three  equations  for  a,  6,  and  c: 

f{xi)  =  a(xi+i  -  Ax)^  +  b{xi+i  -  Ax)  +  c 
f{xi+i)  =  a(xi+i)^  +  b{xi+i)  +  c 
/(xi+2)  =  a{xi+i  +  Ax)^  +  b{xi+i  +  Ax)  +  c 
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Not  surprisingly,  the  solutions  turn  out  to  be  quite  messy.  Nevertheless,  Sage  can  easily 
compute  and  simplify  the  integral  to  get 

/  ax^  +  bx  +  cdx=  —{f{xi)  +  4:f{xi+i)  +  f{xi+2)). 

Now  the  sum  of  the  areas  under  all  parabolas  is 

^ifixo)  +  4fixi)+fix2)  +  fix2)  +  mX3)  +  fM  +  ---  +  fiXn-2)  +  4fiXn-l)  +  fiXn))  = 

Ax 

—  ifixo)  +  4/(a;i)  +  2f{x2)  +  ^/(xs)  +  2f{x^)  +  ■■■  +  2f{Xn-2)  +  ^/(Xn-l)  +  f{Xn)). 

This  is  just  slightly  more  complicated  than  the  formula  for  trapezoids;  we  need  to  remem- 
ber the  alternating  2  and  4  coefficients.  This  approximation  technique  is  referred  to  as 
Simpson's  Rule. 


{Xi,f{Xi)) 


{Xi+2j{Xi+2)) 


Xi  Xi+1  Xi+2 

Figure  8.6.3     A  parabola  (dashed)  approximating  a  curve  (solid). 

As  with  the  trapezoid  method,  this  is  useful  only  with  an  error  estimate: 

THEOREM  8.6.3  Suppose  /  has  a  fourth  derivative  /'■^•'  everywhere  on  the  interval 
[a,  6],  and  |/'^'^^(a;)|  <  M  for  all  x  in  the  interval.  With  Ax  =  {b  —  a)/n,  an  error  estimate 
for  Simpson's  approximation  is 

A   N      b  —  a^^,^   .A      {b  —  a)^  ^  ^ 


EXAMPLE  8.6.4    Let  us  again  approximate  /   e  ^  dx  to  two  decimal  places.  The 

Jo 

fourth  derivative  of  /  =  is  (16x^  —  48x^  +  12)e~^  ;  on  [0, 1]  this  is  at  most  12  in 

absolute  value.  We  begin  by  estimating  the  number  of  subintervals  we  are  likely  to  need. 
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To  get  two  decimal  places  of  accuracy,  we  will  certainly  need  E{Ax)  <  0.005,  but  taking 
a  cue  from  our  earlier  example,  let's  require  E{Ax)  <  0.001: 

^   12)^  <  0.001 


180 '    '  n4 
200 


2.86  K  <  „ 

So  we  try  n  =  4,  since  we  need  an  even  number  of  subintervals.  Then  the  error  estimate 
is  12/180/4*  <  0.0003  and  the  approximation  is 

(/(O)  +  4/(1/4)  +  2/(1/2)  +  4/(3/4)  +  /(I)) ^  ^  0.746855. 

So  the  true  value  of  the  integral  is  between  0.746855  -  0.0003  =  0.746555  and  0.746855  + 
0.0003  =  0.7471555,  both  of  which  round  to  0.75.  □ 

Exercises  8.6. 

In  the  following  problems,  compute  the  trapezoid  and  Simpson  approximations  using  4  subin- 
tervals, and  compute  the  error  estimate  for  each.  (Finding  the  maximum  values  of  the  second 
and  fourth  derivatives  can  be  challenging  for  some  of  these;  you  may  use  a  graphing  calculator 
or  computer  software  to  estimate  the  maximum  values.)  If  you  have  access  to  Sage  or  similar 
software,  approximate  each  integral  to  two  decimal  places.  You  can  use  this  to  get  started. 
^■3  /-a 
1.     I    X  dx  ^  2.         x^  dx 


5. 


9. 


J  xdx  ^  ^-  J 

f    3  /"I 

X  dx  ^  4.  -  dx  ^ 

J2  Ji  X 

I  — ^ — -  dx  ^  6.     /  x\/l  +  X  dx  = 

Ji   1  +  a;2  /o 

/      ^    dx  ^  8.     /  ^/x^  +  ldx  = 

Ji   1  +  Jo 

J  \/x^  +  ldx  ^  10.  ^\  +  Xjx  dx 


11.  Using  Simpson's  rule  on  a  parabola  /(x),  even  with  just  two  subintervals,  gives  the  exact  value 
of  the  integral,  because  the  parabolas  used  to  approximate  /  will  be  /  itself.  Remarkably, 
Simpson's  rule  also  computes  the  integral  of  a  cubic  function  /(x)  =  ax^  +  hx^  +  cx  +  d 
exactly.  Show  this  is  true  by  showing  that 

fix)  dx  =  ^\=^(/(xo)  +  4/((xo  +  X2)/2)  +  /(X2)). 

This  does  require  a  bit  of  messy  algebra,  so  you  may  prefer  to  use  Sage. 
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8.7    Additional  exercises 

These  problems  require  the  techniques  of  this  chapter,  and  are  in  no  particular  order.  Some 
problems  may  be  done  in  more  than  one  way. 

1.    j{t  +  Afdt^  2.  j  t{t^ -9f'^  dt^ 

3.    j  {e^  +  16)te^  dt  ^  4.  J  smtcos2t  dt  =^ 

f  2  [    2t  +  l 

5.     /  tantsec  tdt  6.  /  —  dt 

J  J  t^+t  +  3 

T.     /    ,  dt  ^  8.  — — ^ ,,,,,  dt 


J  Ve^Tl 


sec^  t 


15-     /  Tz,  r^dt 

(l  +  tant)3 


23.     I  sin^tcos'^  tdt  ^  24.    /-  dt 

t^  -6t  +  9 


1 


27.     /  t^e  dt  28.    /         '  ^  dt 

t^  +t-l 


t(t2-4)  ■    J  (25-t2)3/ 

,     cos  3t     ,  f         2  , 

9.     /  dt  ^  10.        t sec  tdt 

V  sin  3t  J 

11.     /  dt  ^  12.     /  cos'^tdt  = 


13.     /        —  dt  14.     /   .  ^  dt 


t^+3t  J  t^./TTt^ 


16.   J  t^\/¥^dt  =^ 

Jesintdt^  18.   j  {t^'"^  +  Alf^ftdt 

t'         ..  „„      /■  1 


17. 


21.     /  ^^^^^dt^  22.     /  .  ! 


4t2  /  t2  +  2t  -  3 


1 


25-    /  J^dt^  26.    /  t(lnt)2dt 


t+1 


9 

Applications  of  Integration 


9.1    Area  between  curves 

We  have  seen  how  integration  can  be  used  to  find  an  area  between  a  curve  and  the  x-axis. 
With  very  httle  change  we  can  find  some  areas  between  curves;  indeed,  the  area  between 
a  curve  and  the  x-axis  may  be  interpreted  as  the  area  between  the  curve  and  a  second 
"curve"  with  equation  y  =  0.  In  the  simplest  of  cases,  the  idea  is  quite  easy  to  understand. 

EXAMPLE  9.1.1  Find  the  area  below  f{x)  =  -x^  +  4x  +  3  and  above  g{x)  =  -x^  + 
7x^  —  lOx  +  5  over  the  interval  1  <  x  <  2.  In  figure  9.1.1  we  show  the  two  curves  together, 
with  the  desired  area  shaded,  then  /  alone  with  the  area  under  /  shaded,  and  then  g  alone 
with  the  area  under  g  shaded. 


Figure  9.1.1     Area  between  curves  as  a  difference  of  areas. 
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It  is  clear  from  the  figure  tliat  the  area  we  want  is  the  area  under  /  minus  the  area 
under  g,  which  is  to  say 


/2  /'2  /'2 

f{x)dx~  /   g{x)  dx  =  /  f{x)—g{x)dx. 

It  doesn't  matter  whether  we  compute  the  two  integrals  on  the  left  and  then  subtract  or 
compute  the  single  integral  on  the  right.  In  this  case,  the  latter  is  perhaps  a  bit  easier: 

2  /•2 

f{x)  —  g{x)  dx  =  I    —x"^  +  4x  +  3  —  {—x^  +       —  lOx  +  5)  dx 

2 

x^  -  8x^  +  14x  -2dx 

2 


1 


1 


x^     8x^     „  9 

 —  +  7x^-2x 

4  3 

16     64  ^      1      8     „  , 

^-y+28-4-(---+7-2) 

3      4  12 


□ 


It  is  worth  examining  this  problem  a  bit  more.  We  have  seen  one  way  to  look  at  it, 
by  viewing  the  desired  area  as  a  big  area  minus  a  small  area,  which  leads  naturally  to  the 
difference  between  two  integrals.  But  it  is  instructive  to  consider  how  we  might  find  the 
desired  area  directly.  We  can  approximate  the  area  by  dividing  the  area  into  thin  sections 
and  approximating  the  area  of  each  section  by  a  rectangle,  as  indicated  in  figure  9.1.2. 
The  area  of  a  typical  rectangle  is  Ax{f{xi)  —  g{xi)),  so  the  total  area  is  approximately 

n-l 

^{f{xi)  -g{xi))Ax. 

i=0 

This  is  exactly  the  sort  of  sum  that  turns  into  an  integral  in  the  limit,  namely  the  integral 

f2 

f{x)  -  g{x)  dx. 


1 

Of  course,  this  is  the  integral  we  actually  computed  above,  but  we  have  now  arrived  at  it 
directly  rather  than  as  a  modification  of  the  difference  between  two  other  integrals.  In  that 
example  it  really  doesn't  matter  which  approach  we  take,  but  in  some  cases  this  second 
approach  is  better. 


9.1    Area  between  curves  193 


10  n 


5 


0 


0 


1 


2 


3 


Figure  9.1.2     Approximating  area  between  curves  with  rectangles. 


EXAMPLE  9.1.2  Find  the  area  below  f{x)  =  -x^  +  4a;  +  1  and  above  g{x)  =  -x^  + 
7x^  —  lOx  +  3  over  the  interval  1  <  x  <  2;  these  are  the  same  curves  as  before  but  lowered 
by  2.  In  figure  9.1.3  we  show  the  two  curves  together.  Note  that  the  lower  curve  now  dips 
below  the  x-axis.  This  makes  it  somewhat  tricky  to  view  the  desired  area  as  a  big  area 
minus  a  smaller  area,  but  it  is  just  as  easy  as  before  to  think  of  approximating  the  area 
by  rectangles.  The  height  of  a  typical  rectangle  will  still  be  /(x^)  —  g{xi),  even  if  g{xi)  is 
negative.  Thus  the  area  is 


This  is  of  course  the  same  integral  as  before,  because  the  region  between  the  curves  is 


identical  to  the  former  region — it  has  just  been  moved  down  by  2. 


□ 


y 


10  - 


0 


5  - 


0 


Figure  9.1.3     Area  between  curves. 


EXAMPLE  9.1.3  Find  the  area  between  /(x)  =  — x^  +  4x  and  g{x)  =  x^  —  6x  +  5  over 
the  interval  0  <  x  <  1;  the  curves  are  shown  in  figure  9.1.4.  Generally  we  should  interpret 
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"area"  in  the  usual  sense,  as  a  necessarily  positive  quantity.  Since  the  two  curves  cross, 
we  need  to  compute  two  areas  and  add  them.  First  we  find  the  intersection  point  of  the 
curves: 

—x^  +  4x  =      —  6x  +  5 

0  =       -  lOx  +  5 

10  ±  VlOO  -  40      5  ±v^ 

X  =    =   . 

4  2 

The  intersection  point  we  want  is  x  =  a  =  (5  —  -\/l5)/2.  Then  the  total  area  is 

•  1 

x^  —  6x  +  5  —  (— x^  +  4x)  dx  +  I 

'0 


X  —  6x  +  5  — (— X  +  Ax)  dx  +  I    —X  +  4x  —  (x  —  6x  +  5)  dx 

2x^  -  lOx  +  5  cix  +  / 

'0 


=  /    2x2  -  lOx  +  5  cix  +  /    -2x2  +  lOx  -  5  dx 


2x^ 
52 


—  5x  +  5x 


+ 


2x^ 


+  5x  —  5x 


after  a  bit  of  simplification. 


□ 


Figure  9.1.4     Area  between  curves  that  cross. 


EXAMPLE  9.1.4  Find  the  area  between  /(x)  =  — x^  +  4x  and  g{x)  =  x^  —  6x  +  5; 
the  curves  are  shown  in  figure  9.1.5.  Here  we  are  not  given  a  specific  interval,  so  it  must 
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be  the  case  that  there  is  a  "natural"  region  involved.  Since  the  curves  are  both  parabolas, 
the  only  reasonable  interpretation  is  the  region  between  the  two  intersection  points,  which 
we  found  in  the  previous  example: 

5±a/I5 


If  we  let  a  =  (5  —  vl5)/2  and  6  =  (5  +  v  15)/2,  the  total  area  is 


/   —x'^  +  Ax  —  (x^  —  6x  +  5)  dx  =  /   —2x'^  +  lOx  —  5dx 

J  a  J  a 


3" 
5\/l5. 


+  5x  —  5x 


after  a  bit  of  simplification. 


□ 


Exercises  9.1. 

Find  the  area  bounded  by  the  curves. 

—      and  y  =      (the  part  to  the  right  of  the  y-axis) 
and  x  =  y'^  ^ 
1  —  y   and  y  =  —x  —  1  =^ 
3y  —      and  x  +  y  =  3  =^ 

cos(7ra;/2)  and  y  =  1  —  a;^  (in  the  first  quadrant)  =^ 
sin(7r2;/3)  and  y  =  x  (in  the  first  quadrant) 
y/x  and  y  =  =^> 
\/x  and  y  =  \/x  +  1,  0  <  a;  <  4 
0  and  a;  =  25  —  y^  =^ 
sin  X  cos  X  and  y  =  sinx,  0  <  x  <  tt  =^ 


1. 

y 

2. 

X 

3. 

X 

4. 

X 

5. 

y 

6. 

y 

7. 

y 

8. 

y 

9. 

X 

10. 

y 
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11  3/2         1  2/3 

11.  y  =  X  '    and  y  =  x  ' 

12.  y  =  x'^  —  2x  and  y  =  x 


The  following  three  exercises  expand  on  the  geometric  interpretation  of  the  hyperbolic  functions. 
Refer  to  section  4.11  and  particularly  to  figure  4.11.2  and  exercise  6  in  section  4.11. 

.3.  Co.p.e  /  ..e  ........  .  .  o..  .  .  e,e...  . 

in  section  4.11. 


14.  Fix  t  >  0.  Sketch  the  region  R  in  the  right  half  plane  bounded  by  the  curves  y  =  xtanht, 
y  =  — xtanht,  and  x^  —  y"^  = 

15.  Prove  that  the  area  of  i?  is 


y  =  — xtanht,  and  x^  —  y^  =  1.  Note  well:  t  is  fixed,  the  plane  is  the  x-y  plane. 


9.2    Distance,  Velocity,  Acceleration 

We  next  recall  a  general  principle  that  will  later  be  applied  to  distance- velocity-acceleration 
problems,  among  other  things.  If  F{u)  is  an  anti-derivative  of  /(w),  then  /   f{u)du  = 

J  a 

F{h)  —  F{a).  Suppose  that  we  want  to  let  the  upper  limit  of  integration  vary,  i.e.,  we 
replace  b  by  some  variable  x.  We  think  of  a  as  a  fixed  starting  value  xq.  In  this  new 
notation  the  last  equation  (after  adding  F{a)  to  both  sides)  becomes: 


F{x)  =  F{xo)  +  I  f{u)du. 

J  xo 


(Here  u  is  the  variable  of  integration,  called  a  "dummy  variable,"  since  it  is  not  the  variable 
in  the  function  F[x).  In  general,  it  is  not  a  good  idea  to  use  the  same  letter  as  a  variable 

.X 

of  integration  and  as  a  limit  of  integration.  That  is,  /    f{x)dx  is  bad  notation,  and  can 

J  Xo 

lead  to  errors  and  confusion.) 

An  important  application  of  this  principle  occurs  when  we  are  interested  in  the  position 
of  an  object  at  time  t  (say,  on  the  x-axis)  and  we  know  its  position  at  time  to-  Let  s{t) 
denote  the  position  of  the  object  at  time  t  (its  distance  from  a  reference  point,  such  as 
the  origin  on  the  x-axis).  Then  the  net  change  in  position  between  to  and  t  is  s{t)  —  s{to). 
Since  s{t)  is  an  anti-derivative  of  the  velocity  function  v{t),  we  can  write 

s{t)  =  s{to)  +  /  v{u)du. 


0 


Similarly,  since  the  velocity  is  an  anti-derivative  of  the  acceleration  function  a(t),  we  have 


v{t)  =  f (to)  +  /  a{u)du. 


0 
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EXAMPLE  9.2.1  Suppose  an  object  is  acted  upon  by  a  constant  force  F.  Find  v{t) 
and  s{t).  By  Newton's  law  F  =  ma,  so  the  acceleration  is  F/m,  where  m  is  the  mass  of 
the  object.  Then  we  first  have 


=  Vq-\  {t-  to), 

m 


r  F  F 
v(t)  =  v(to)  +      —  du  =  vo^  

J  to  ™  ™ 
using  the  usual  convention  vq  =  v{to).  Then 

s{t)  =  s{to)  +  [    (vq+  —{u-  to)]  du^  So+  {vqU  +  7r^(w  -  to)^) 


to 


=  so  +  ^^o(^-^o)  +  ;^(^-^o)^- 
Am 

For  instance,  when  F/m  =  —g  is  the  constant  of  gravitational  acceleration,  then  this  is 
the  falling  body  formula  (if  we  neglect  air  resistance)  familiar  from  elementary  physics: 

so+vo{t-to)  -  |(^-^o)^ 


or  in  the  common  case  that  to  =  0, 


So  +  vot  - 


u 


Recall  that  the  integral  of  the  velocity  function  gives  the  net  distance  traveled.  If  you 
want  to  know  the  total  distance  traveled,  you  must  find  out  where  the  velocity  function 
crosses  the  t-axis,  integrate  separately  over  the  time  intervals  when  v{t)  is  positive  and 
when  v{t)  is  negative,  and  add  up  the  absolute  values  of  the  different  integrals.  For 
example,  if  an  object  is  thrown  straight  upward  at  19.6  m/sec,  its  velocity  function  is 
v{t)  =  —9.8t+  19.6,  using  =  9.8  m/sec  for  the  force  of  gravity.  This  is  a  straight  line 
which  is  positive  for  t  <  2  and  negative  for  t  >  2.  The  net  distance  traveled  in  the  first  4 
seconds  is  thus 


/ 

^0 


(-9.8t  +  19.6)dt  =  0, 


while  the  total  distance  traveled  in  the  first  4  seconds  is 


f 

Jo 


{-9.8t+19.6)dt  + 


(-9.8t+ 19.6)dt 


=  19.6+1  -  19.61  =  39.2 


meters,  19.6  meters  up  and  19.6  meters  down. 
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EXAMPLE  9.2.2    The  acceleration  of  an  object  is  given  by  a{t)  =  cos{nt),  and  its 
velocity  at  time  t  =  0  is  1/  {2n) .  Find  both  the  net  and  the  total  distance  traveled  in  the 
first  1.5  seconds. 
We  compute 


v{t)  =  v{0)  +  /  cos{TTu)du 


0 


1  1     •    .  X 

 h  —  sminu) 


The  net  distance  traveled  is  then 


s(3/2)  -  s(0) 


0 


TX 


+  sin(7rt)  )  dt 


1ft  1 

-    -  -  -  cos(7rt) 

TT   \  Z  TV 


3/2 


3  1 


4n 


71^ 


0.340  meters. 


To  find  the  total  distance  traveled,  we  need  to  know  when  (0.5  +  sin(7rt))  is  positive  and 
when  it  is  negative.  This  function  is  0  when  sin(7rt)  is  —0.5,  i.e.,  when  nt  =  7n/6,  IItt/G, 
etc.  The  value  rrt  =  77r/6,  i.e.,  t  =  7/6,  is  the  only  value  in  the  range  0  <  t  <  1.5.  Since 
v{t)  >  0  for  t  <  7/6  and  v{t)  <  0  for  t  >  7/6,  the  total  distance  traveled  is 


7/6 


1  /I 


TT 


+  sin(7rt)    dt  + 


3/2 


1  /I 


7/6 

1 


+  sin(7rt)  dt 


-  (^  +  -cos{7n/6)  +  - 
7r  yiz      TV  TV 


1(7  lV3  l\  1 
7r\12      n  2       n  n 


+ 


^  J-T7^  +  -  cos(77r/6) 

TT    4         iZ  TT 


L  1^ 
12  ^  n^' 


0.409  meters. 


□ 


Exercises  9.2. 

For  each  velocity  function  find  both  the  net  distance  and  the  total  distance  traveled  during  the 
indicated  time  interval  (graph  v{t)  to  determine  when  it's  positive  and  when  it's  negative): 

1.  v  =  cos(7ri),  0  <  t  <  2.5  ^ 

2.  v  =  -9.8t  +  49,  0  <  t  <  10  ^ 

3.  v  =  3{t  -  3)(t  -  1),  0  <  t  <  5  ^ 

4.  V  =  sin(7rt/3)  -t,0<t<l  ^ 

5.  An  object  is  shot  upwards  from  ground  level  with  an  initial  velocity  of  2  meters  per  second; 
it  is  subject  only  to  the  force  of  gravity  (no  air  resistance).  Find  its  maximum  altitude  and 
the  time  at  which  it  hits  the  ground.  =^ 

6.  An  object  is  shot  upwards  from  ground  level  with  an  initial  velocity  of  3  meters  per  second; 
it  is  subject  only  to  the  force  of  gravity  (no  air  resistance).  Find  its  maximum  altitude  and 
the  time  at  which  it  hits  the  ground. 
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7.  An  object  is  shot  upwards  from  ground  level  with  an  initial  velocity  of  100  meters  per  second; 
it  is  subject  only  to  the  force  of  gravity  (no  air  resistance).  Find  its  maximum  altitude  and 
the  time  at  which  it  hits  the  ground.  =^ 

8.  An  object  moves  along  a  straight  line  with  acceleration  given  by  a{t)  =  —  cos(t),  and  ,3(0)  =  1 
and  v(0)  =  0.  Find  the  maximum  distance  the  object  travels  from  zero,  and  find  its  maximum 
speed.  Describe  the  motion  of  the  object. 

9.  An  object  moves  along  a  straight  line  with  acceleration  given  by  a{t)  =  sin(7ri).  Assume  that 
when  t  =  0,  s{t)  =  v(t)  =  0.  Find  s{t),  v{t),  and  the  maximum  speed  of  the  object.  Describe 
the  motion  of  the  object.  =^> 

10.  An  object  moves  along  a  straight  line  with  acceleration  given  by  a{t)  =  1  +  sin(7rt).  Assume 
that  when  t  =  0,  s{t)  =  v{t)  =  0.  Find  s{t)  and  v{t).  =^ 

11.  An  object  moves  along  a  straight  line  with  acceleration  given  by  a{t)  =  1  —  sin(7ri).  Assume 
that  when  t  =  0,  s{t)  =  v{t)  =  0.  Find  s{t)  and  v{t).  =^ 

9.3  Volume 

We  have  seen  how  to  compute  certain  areas  by  using  integration;  some  volumes  may  also 
be  computed  by  evaluating  an  integral.  Generally,  the  volumes  that  we  can  compute  this 
way  have  cross-sections  that  are  easy  to  describe. 


Figure  9.3.1     Volume  of  a  pyramid  approximated  by  rectangular  prisms.  (AP) 

EXAMPLE  9.3.1  Find  the  volume  of  a  pyramid  with  a  square  base  that  is  20  meters 
tall  and  20  meters  on  a  side  at  the  base.  As  with  most  of  our  applications  of  integration,  we 
begin  by  asking  how  we  might  approximate  the  volume.  Since  we  can  easily  compute  the 
volume  of  a  rectangular  prism  (that  is,  a  "box"),  we  will  use  some  boxes  to  approximate 
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the  volume  of  the  pyramid,  as  shown  in  figure  9.3.1:  on  the  left  is  a  cross-sectional  view,  on 
the  right  is  a  3D  view  of  part  of  the  pyramid  with  some  of  the  boxes  used  to  approximate 
the  volume. 

Each  box  has  volume  of  the  form  {2xi){2xi)Ay.  Unfortunately,  there  are  two  variables 
here;  fortunately,  we  can  write  x  in  terms  of  y:  x  =  10  —  y/2  or  =  10  —  yi/2.  Then  the 
total  volume  is  approximately 

n-l 


^4(10-2/,/2)2Ay 


i=0 


and  in  the  limit  we  get  the  volume  as  the  value  of  an  integral: 


20 


4{10-y/2fdy 


20 


{20 -y)^dy 


(20  -  yf 


20 


03 

3 


20^ 

IT 


8000 


As  you  may  know,  the  volume  of  a  pyramid  is  (1/3)  (height)  (area  of  base)  =  (l/3)(20)(400), 
which  agrees  with  our  answer.  □ 

EXAMPLE  9.3.2  The  base  of  a  solid  is  the  region  between  f{x)  =  x^  —  1  and  g{x)  = 
— +  1,  and  its  cross-sections  perpendicular  to  the  x-axis  are  equilateral  triangles,  as 
indicated  in  figure  9.3.2.  The  solid  has  been  truncated  to  show  a  triangular  cross-section 
above  x  =  1/2.  Find  the  volume  of  the  solid. 


Figure  9.3.2     Solid  with  equilateral  triangles  as  cross-sections.  (AP) 

A  cross-section  at  a  value  Xi  on  the  x-axis  is  a  triangle  with  base  2(1  —  x^)  and  height 
\/3(l  —  x^),  so  the  area  of  the  cross-section  is 


-(base) (height)  =  (1  -  x^)v^(l  - 
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and  the  volume  of  a  thin  "slab"  is  then 

(1  -x?)y3(l  -a;^)Aa;. 

Thus  the  total  volume  is 

r  v^(l-x2)2rfx=^y3. 

i-1  15  ^ 

One  easy  way  to  get  "nice"  cross-sections  is  by  rotating  a  plane  figure  around  a  line. 
For  example,  in  figure  9.3.3  we  see  a  plane  region  under  a  curve  and  between  two  vertical 
lines;  then  the  result  of  rotating  this  around  the  x-axis,  and  a  typical  circular  cross-section. 


Figure  9.3.3     A  solid  of  rotation. 


Of  course  a  real  "slice"  of  this  figure  will  not  have  straight  sides,  but  we  can  approxi- 
mate the  volume  of  the  slice  by  a  cylinder  or  disk  with  circular  top  and  bottom  and  straight 
sides;  the  volume  of  this  disk  will  have  the  form  Tir^  Ax.  As  long  as  we  can  write  r  in  terms 
of  X  we  can  compute  the  volume  by  an  integral. 

EXAMPLE  9.3.3  Find  the  volume  of  a  right  circular  cone  with  base  radius  10  and 
height  20.  (A  right  circular  cone  is  one  with  a  circular  base  and  with  the  tip  of  the  cone 
directly  over  the  center  of  the  base.)  We  can  view  this  cone  as  produced  by  the  rotation 
of  the  line  y  =  x/2  rotated  about  the  indicated  in  figure  9.3.4. 

At  a  particular  point  on  the  x-axis,  say  x^,  the  radius  of  the  resulting  cone  is  the 
y-coordinate  of  the  corresponding  point  on  the  line,  namely  yi  =  Xi/2.  Thus  the  total 
volume  is  approximately 

n-l 

^7r(x,/2)2dx 

and  the  exact  volume  is 

f^^    x2  ,       7r20^  20007r 
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Figure  9.3.4     Approximating  the  volume  of  a  cone  by  circular  disks. 
Note  that  we  can  instead  do  the  calculation  with  a  generic  height  and  radius: 


h  ^2 

TT  — dx 


7rr^ 


nr  h 


giving  us  the  usual  formula  for  the  volume  of  a  cone. 


□ 


EXAMPLE  9.3.4  Find  the  volume  of  the  object  generated  when  the  area  between 
y  =  and  y  =  x  is  rotated  around  the  x-axis.  This  solid  has  a  "hole"  in  the  middle;  we 
can  compute  the  volume  by  subtracting  the  volume  of  the  hole  from  the  volume  enclosed 
by  the  outer  surface  of  the  solid.  In  figure  9.3.5  we  show  the  region  that  is  rotated,  the 
resulting  solid  with  the  front  half  cut  away,  the  cone  that  forms  the  outer  surface,  the 
horn-shaped  hole,  and  a  cross-section  perpendicular  to  the  x-axis. 

We  have  already  computed  the  volume  of  a  cone;  in  this  case  it  is  7r/3.  At  a  particular 
value  of  X,  say  x^,  the  cross-section  of  the  horn  is  a  circle  with  radius  x^,  so  the  volume  of 
the  horn  is 

/   Txix'^Y  dx  =  I    7rx^(ix  =  7r-, 
Jo  Jo  5 

so  the  desired  volume  is  tt/3  —  tv/5  =  27r/15. 

As  with  the  area  between  curves,  there  is  an  alternate  approach  that  computes  the 
desired  volume  "all  at  once"  by  approximating  the  volume  of  the  actual  solid.  We  can 
approximate  the  volume  of  a  slice  of  the  solid  with  a  washer-shaped  volume,  as  indicated 
in  figure  9.3.5. 

The  volume  of  such  a  washer  is  the  area  of  the  face  times  the  thickness.  The  thickness, 
as  usual,  is  Ax,  while  the  area  of  the  face  is  the  area  of  the  outer  circle  minus  the  area  of 
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Figure  9.3.5     Solid  witli  a  iiole,  showing  the  outer  cone  and  the  shape  to  be  removed  to 
form  the  hole.  (AP) 

the  inner  circle,  say  ttR^  —  yrr^.  In  the  present  example,  at  a  particular  Xi,  the  radius  R  is 
Xi  and  r  is  a;?  Hence,  the  whole  volume  is 


TTX^  —  TTx'*  dx  =  n 


X 

y 


X 

~5 


TV 


1 


2n 
15' 


Of  course,  what  we  have  done  here  is  exactly  the  same  calculation  as  before,  except  we 
have  in  effect  recomputed  the  volume  of  the  outer  cone.  □ 

Suppose  the  region  between  f{x)  =  x  +  1  and  g{x)  =  (x  —  1)^  is  rotated  around  the 
y-axis;  see  figure  9.3.6.  It  is  possible,  but  inconvenient,  to  compute  the  volume  of  the 
resulting  solid  by  the  method  we  have  used  so  far.  The  problem  is  that  there  are  two 
"kinds"  of  typical  rectangles:  those  that  go  from  the  line  to  the  parabola  and  those  that 
touch  the  parabola  on  both  ends.  To  compute  the  volume  using  this  approach,  we  need  to 
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break  the  problem  into  two  parts  and  compute  two  integrals: 

/"I  /"4  g  27 

TT      (1  +  7^)2  -  (1  -         dy  +  n  J^{1  +  -{y-  if  dy  =  -n  + —n  =  —tt. 

If  instead  we  consider  a  typical  vertical  rectangle,  but  still  rotate  around  the  y-axis,  we 
get  a  thin  "shell"  instead  of  a  thin  "washer" .  If  we  add  up  the  volume  of  such  thin  shells 
we  will  get  an  approximation  to  the  true  volume.  What  is  the  volume  of  such  a  shell? 
Consider  the  shell  at  Xi.  Imagine  that  we  cut  the  shell  vertically  in  one  place  and  "unroll" 
it  into  a  thin,  flat  sheet.  This  sheet  will  be  almost  a  rectangular  prism  that  is  Ax  thick, 
f{xi)  —  g{xi)  tall,  and  27rxi  wide  (namely,  the  circumference  of  the  shell  before  it  was 
unrolled).  The  volume  will  then  be  approximately  the  volume  of  a  rectangular  prism  with 
these  dimensions:  2Txxi{f{xi)  —  g{xi))^x.  If  we  add  these  up  and  take  the  limit  as  usual, 
we  get  the  integral 

|.3  /.3  27 

/    2'iTx{f{x)—g{x))dx=  /    2nx{x  +  1  —  {x  —  1)'^)  dx  =  —n. 
Jo  Jo  2 

Not  only  does  this  accomplish  the  task  with  only  one  integral,  the  integral  is  somewhat 
easier  than  those  in  the  previous  calculation.  Things  are  not  always  so  neat,  but  it  is 
often  the  case  that  one  of  the  two  methods  will  be  simpler  than  the  other,  so  it  is  worth 
considering  both  before  starting  to  do  calculations. 


0       1       2       3  0       1       2  3 


Figure  9.3.6     Computing  volumes  with  "shells".  (AP) 


EXAMPLE  9.3.5  Suppose  the  area  under  y  =  —x^  +  1  between  x  =  0  and  x  =  1  is 
rotated  around  the  x-axis.  Find  the  volume  by  both  methods. 

/•I  g 

Disk  method:    /    7r(l  —  x^)^  dx  =  — tt. 

./n     ^  ^  15 


Shell  method:    /    2nyy^l  —  ydy 
Jo 


 TT.  □ 

15 
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Exercises  9.3. 


Verify  that  tt^  (l  +  ^f-^l-          dy  +  ir      (i  +  _  (y  _  i) 


8        65  27 

—  TT  H  TT  =   71. 

3        6  2 


27 

2.  Verify  that  /    27rx(x  +  1  -  (x  -  1)^)  da;  =  —it. 

Jo  2 


3.  Verify  that  /  7r(l  —  a;  )  dx  =  —tt 

Jo  15 

4.  Verify  that  /  27ry\/l  —  ydy  =  — tt. 

^0 


0  15 

5.  Use  integration  to  find  the  volume  of  the  sohd  obtained  by  revolving  the  region  bounded  by 
X  +  y  =  2  and  the  x  and  y  axes  around  the  x-axis.  =^> 

6.  Find  the  volume  of  the  solid  obtained  by  revolving  the  region  bounded  by  y  =  x  —  a;'^  and 
the  a;-axis  around  the  x-axis.  =^> 

7.  Find  the  volume  of  the  solid  obtained  by  revolving  the  region  bounded  by  y  =  V sinx  between 
X  =  0  and  x  =  7r/2,  the  y-axis,  and  the  line  y  =  1  around  the  x-axis.  =r- 

8.  Let  5  be  the  region  of  the  xy-plane  bounded  above  by  the  curve  x'^y  =  64,  below  by  the  line 
y  =  1,  on  the  left  by  the  line  x  =  2,  and  on  the  right  by  the  line  x  =  4.  Find  the  volume  of 
the  solid  obtained  by  rotating  S  around  (a)  the  x-axis,  (b)  the  line  y  =  1,  (c)  the  y-axis,  (d) 
the  line  x  =  2.  =^ 

9.  The  equation  x'^/O  +  y'^/4  =  1  describes  an  ellipse.  Find  the  volume  of  the  solid  obtained 
by  rotating  the  ellipse  around  the  x-axis  and  also  around  the  y-axis.  These  solids  are  called 
ellipsoids;  one  is  vaguely  rugby-ball  shaped,  one  is  sort  of  flying-saucer  shaped,  or  perhaps 
squished-beach-ball-shaped.  =^ 


Figure  9.3.7     Ellipsoids.  (AP) 

10.  Use  integration  to  compute  the  volume  of  a  sphere  of  radius  r.  You  should  of  course  get  the 
well-known  formula  47rr^/3. 

11.  A  hemispheric  bowl  of  radius  r  contains  water  to  a  depth  h.  Find  the  volume  of  water  in  the 
bowl.  =f- 

12.  The  base  of  a  tetrahedron  (a  triangular  pyramid)  of  height  h  is  an  equilateral  triangle  of  side 
s.  Its  cross-sections  perpendicular  to  an  altitude  are  equilateral  triangles.  Express  its  volume 
V  as  an  integral,  and  find  a  formula  for  V  in  terms  of  h  and  s.  Verify  that  your  answer  is 
(1/3) (area  of  base) (height). 

13.  The  base  of  a  solid  is  the  region  between  /(x)  =  cosx  and  g{x)  =  —  cosx,  — 7r/2  <  x  <  7r/2, 
and  its  cross-sections  perpendicular  to  the  x-axis  are  squares.  Find  the  volume  of  the  solid. 
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9.4    Average  value  of  a  function 

The  average  of  some  finite  set  of  values  is  a  familiar  concept.  If,  for  example,  the  class 
scores  on  a  quiz  are  10,  9,  10,  8,  7,  5,  7,  6,  3,  2,  7,  8,  then  the  average  score  is  the  sum  of 
these  numbers  divided  by  the  size  of  the  class: 

10  +  9  +  10  +  8  +  7  +  5  +  7  +  6  +  3  +  2  +  7  +  8  82 

average  score  =  =  —  ?s  6.83. 

^  12  12 

Suppose  that  between  t  =  0  and  t  =  1  the  speed  of  an  object  is  sin(7rt).  What  is  the 
average  speed  of  the  object  over  that  time?  The  question  sounds  as  if  it  must  make  sense, 
yet  we  can't  merely  add  up  some  number  of  speeds  and  divide,  since  the  speed  is  changing 
continuously  over  the  time  interval. 

To  make  sense  of  "average"  in  this  context,  we  fall  back  on  the  idea  of  approximation. 
Consider  the  speed  of  the  object  at  tenth  of  a  second  intervals:  sinO,  sin(0.l7r),  sin(0.27r), 
sin(0.37r),. . . ,  sin(0.97r).  The  average  speed  "should"  be  fairly  close  to  the  average  of  these 
ten  speeds: 


1  ' 


^^^sin(7rVlO)^  ^6.3  =  0.63. 

i=0 


Of  course,  if  we  compute  more  speeds  at  more  times,  the  average  of  these  speeds  should 
be  closer  to  the  "real"  average.  If  we  take  the  average  of  n  speeds  at  evenly  spaced  times, 
we  get: 


^  n—L 

-  J]sin(7rVn). 


i=0 


n-1 

n 

Here  the  individual  times  are  ti  =  i/n,  so  rewriting  slightly  we  have 

n-1 

n 


^  n—l 

Esin(7rtj). 


i=0 


This  is  almost  the  sort  of  sum  that  we  know  turns  into  an  integral;  what's  apparently 
missing  is  At — but  in  fact.  At  —  1/n,  the  length  of  each  subinterval.  So  rewriting  again: 


n— 1  ^  n—l 


^^sin(7rtj)—  =       sin(7rtj)  At. 


1=0  ^  i=0 


Now  this  has  exactly  the  right  form,  so  that  in  the  limit  we  get 

cos(7rt) 


,            .  r    \  1  cos(7i 
average  speed  =  /    sm(7rt j  at  =  


^        cos(7r)     cos(O)  2 
=  ^  +  — ^  =  -     0.6366  0.64. 

Q  TT  TT  TT 


It's  not  entirely  obvious  from  this  one  simple  example  how  to  compute  such  an  average 
in  general.  Let's  look  at  a  somewhat  more  complicated  case.  Suppose  that  the  velocity 
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of  an  object  is  16t^  +  5  feet  per  second.  What  is  the  average  velocity  between  t 
t  =  3?  Again  we  set  up  an  approximation  to  the  average: 


1  and 


^  n—  1 

where  the  values  ti  are  evenly  spaced  times  between  1  and  3.  Once  again  we  are  "missing" 
At,  and  this  time  1/n  is  not  the  correct  value.  What  is  At  in  general?  It  is  the  length  of 
a  subinterval;  in  this  case  we  take  the  interval  [1,3]  and  divide  it  into  n  subintervals,  so 
each  has  length  {3  —  l)/n  =  2/n  =  At.  Now  with  the  usual  "multiply  and  divide  by  the 
same  thing"  trick  we  can  rewrite  the  sum: 

^   n  — 1  ^       n—1  Q       1         1         1  „         ^  n—1 

i=0  i=0  i=0  1=0 

In  the  limit  this  becomes 


1 


+  5dt 


1  446  _  223 
2ir  ~  IT' 


Does  this  seem  reasonable?  Let's  picture  it:  in  figure  9.4.1  is  the  velocity  function  together 
with  the  horizontal  line  y  =  223/3  ~  74.3.  Certainly  the  height  of  the  horizontal  line  looks 
at  least  plausible  for  the  average  height  of  the  curve. 


150 


125 


100 


Figure  9.4.1     Average  velocity. 


Here's  another  way  to  interpret  "average"  that  may  make  our  computation  appear 
even  more  reasonable.  The  object  of  our  example  goes  a  certain  distance  between  t  =  1 
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and  t  =  3.  If  instead  the  object  were  to  travel  at  the  average  speed  over  the  same  time,  it 
should  go  the  same  distance.  At  an  average  speed  of  223/3  feet  per  second  for  two  seconds 
the  object  would  go  446/3  feet.  How  far  does  it  actually  go?  We  know  how  to  compute 
this: 

j    v{t)dt  =  j   16t2  +  5rft=— . 
So  now  we  see  that  another  interpretation  of  the  calculation 

1  5  ,  1  446  223 
-  /  +  5dt=  =   

2  ii  2  3  3 

is:  total  distance  traveled  divided  by  the  time  in  transit,  namely,  the  usual  interpretation 
of  average  speed. 

In  the  case  of  speed,  or  more  properly  velocity,  we  can  always  interpret  "average"  as 
total  (net)  distance  divided  by  time.  But  in  the  case  of  a  different  sort  of  quantity  this 
interpretation  does  not  obviously  apply,  while  the  approximation  approach  always  does. 
We  might  interpret  the  same  problem  geometrically:  what  is  the  average  height  of  16x^  +  5 
on  the  interval  [1,  3]?  We  approximate  this  in  exactly  the  same  way,  by  adding  up  many 
sample  heights  and  dividing  by  the  number  of  samples.  In  the  limit  we  get  the  same  result: 

1  2  1   /"^      2        ,       1  446  223 

lim  —  >    16x,  +  5  =  -  /    16a;  +  5  ax  =  =   . 

n^oo  '  2  2  3  3 

We  can  interpret  this  result  in  a  slightly  different  way.  The  area  under  y  =  16x^  +  5  above 
[1,3]  is 

9        ,  446 

The  area  under  y  =  223/3  over  the  same  interval  [1,3]  is  simply  the  area  of  a  rectangle 
that  is  2  by  223/3  with  area  446/3.  So  the  average  height  of  a  function  is  the  height  of  the 
horizontal  line  that  produces  the  same  area  over  the  given  interval. 

Exercises  9.4- 

1.  Find  the  average  height  of  cosx  over  the  intervals  [0,7r/2],  [— 7r/2, 7r/2],  and  [0,27r].  =^ 

2.  Find  the  average  height  of  x'^  over  the  interval  [—2,  2]. 

3.  Find  the  average  height  of  l/x'^  over  the  interval  [1,^]. 

4.  Find  the  average  height  of  \/l  —      over  the  interval  [—1,1]. 

5.  An  object  moves  with  velocity  v{t)  =  —t^  +  1  feet  per  second  between  t  =  0  and  t  =  2.  Find 
the  average  velocity  and  the  average  speed  of  the  object  between  t  =  0  and  t  =  2.  =^ 
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6.  The  observation  deck  on  the  102nd  floor  of  the  Empire  State  Building  is  1,224  feet  above 
the  ground.  If  a  steel  ball  is  dropped  from  the  observation  deck  its  velocity  at  time  t  is 
approximately  v(t)  =  —32t  feet  per  second.  Find  the  average  speed  between  the  time  it  is 
dropped  and  the  time  it  hits  the  ground,  and  find  its  speed  when  it  hits  the  ground.  =^ 


A  fundamental  concept  in  classical  physics  is  work:  If  an  object  is  moved  in  a  straight 
line  against  a  force  F  for  a  distance  s  the  work  done  is  W  =  Fs. 

EXAMPLE  9.5.1     How  much  work  is  done  in  lifting  a  10  pound  weight  vertically  a 
distance  of  5  feet?  The  force  due  to  gravity  on  a  10  pound  weight  is  10  pounds  at  the 
surface  of  the  earth,  and  it  does  not  change  appreciably  over  5  feet.  The  work  done  is 
=  10  ■  5  =  50  foot-pounds.  □ 

In  reality  few  situations  are  so  simple.  The  force  might  not  be  constant  over  the  range 
of  motion,  as  in  the  next  example. 

EXAMPLE  9.5.2  How  much  work  is  done  in  lifting  a  10  pound  weight  from  the  surface 
of  the  earth  to  an  orbit  100  miles  above  the  surface?  Over  100  miles  the  force  due  to  gravity 
does  change  significantly,  so  we  need  to  take  this  into  account.  The  force  exerted  on  a  10 
pound  weight  at  a  distance  r  from  the  center  of  the  earth  is  F  =  k/r"^  and  by  definition 
it  is  10  when  r  is  the  radius  of  the  earth  (we  assume  the  earth  is  a  sphere).  How  can  we 
approximate  the  work  done?  We  divide  the  path  from  the  surface  to  orbit  into  n  small 
subpaths.  On  each  subpath  the  force  due  to  gravity  is  roughly  constant,  with  value  /e/r^ 
at  distance  r^.  The  work  to  raise  the  object  from  to  r^+i  is  thus  approximately  /c/rf  Ar 
and  the  total  work  is  approximately 
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n-l 


or  in  the  limit 


where  tq  is  the  radius  of  the  earth  and  ri  is  ro  plus  100  miles.  The  work  is 


Using  ro  =  20925525  feet  we  have  ri  =  21453525.  The  force  on  the  10  pound  weight  at 
the  surface  of  the  earth  is  10  pounds,  so  10  =  A;/20925525^  giving  k  =  4378775965256250. 
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Then 

k      k      491052320000  ^ 

 1  =  ^  5150052  foot-pounds. 

ri     ro  95349 

Note  that  if  we  assume  the  force  due  to  gravity  is  10  pounds  over  the  whole  distance  we 
would  calculate  the  work  as  10(ri  —  ro)  =  10  •  100  •  5280  =  5280000,  somewhat  higher  since 
we  don't  account  for  the  weakening  of  the  gravitational  force.  □ 

EXAMPLE  9.5.3  How  much  work  is  done  in  lifting  a  10  kilogram  object  from  the 
surface  of  the  earth  to  a  distance  D  from  the  center  of  the  earth?  This  is  the  same 
problem  as  before  in  different  units,  and  we  are  not  specifying  a  value  for  D.  As  before 


l""  \dr  =  -- 


D 


ro 


k_  k_ 


While  "weight  in  pounds"  is  a  measure  of  force,  "weight  in  kilograms"  is  a  measure  of  mass. 
To  convert  to  force  we  need  to  use  Newton's  law  F  =  ma.  At  the  surface  of  the  earth  the 
acceleration  due  to  gravity  is  approximately  9.8  meters  per  second  squared,  so  the  force  is 
F  =  10-9.8  =  98.  The  units  here  are  "kilogram- meters  per  second  squared"  or  "kg  m/s^", 
also  known  as  a  Newton  (N),  so  -F  =  98  N.  The  radius  of  the  earth  is  approximately  6378.1 
kilometers  or  6378100  meters.  Now  the  problem  proceeds  as  before.  From  F  =  k/r^  we 
compute  k:  98  =  A;/6378100^  k  =  3.986655642  •  10^^.  Then  the  work  is: 

k 

W  ^  -—  +  6.250538000  ■  10^  Newton-meters. 

As  D  increases  W  of  course  gets  larger,  since  the  quantity  being  subtracted,  —k/D,  gets 
smaller.  But  note  that  the  work  W  will  never  exceed  6.250538000  •  10®,  and  in  fact  will 
approach  this  value  as  D  gets  larger.  In  short,  with  a  finite  amount  of  work,  namely 
6.250538000  •  10®  N-m,  we  can  lift  the  10  kilogram  object  as  far  as  we  wish  from  earth. 

□ 

Next  is  an  example  in  which  the  force  is  constant,  but  there  are  many  objects  moving 
different  distances. 


EXAMPLE  9.5.4  Suppose  that  a  water  tank  is  shaped  like  a  right  circular  cone  with 
the  tip  at  the  bottom,  and  has  height  10  meters  and  radius  2  meters  at  the  top.  If  the 
tank  is  full,  how  much  work  is  required  to  pump  all  the  water  out  over  the  top?  Here  we 
have  a  large  number  of  atoms  of  water  that  must  be  lifted  different  distances  to  get  to  the 
top  of  the  tank.  Fortunately,  we  don't  really  have  to  deal  with  individual  atoms — ^we  can 
consider  all  the  atoms  at  a  given  depth  together. 
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^  2  ^ 


10 


Figure  9.5.1     Cross-section  of  a  conical  water  tank. 


To  approximate  the  work,  we  can  divide  the  water  in  the  tank  into  horizontal  sections, 
approximate  the  volume  of  water  in  a  section  by  a  thin  disk,  and  compute  the  amount  of 
work  required  to  lift  each  disk  to  the  top  of  the  tank.  As  usual,  we  take  the  limit  as  the 
sections  get  thinner  and  thinner  to  get  the  total  work. 

At  depth  h  the  circular  cross-section  through  the  tank  has  radius  r  =  (10  —  h)/5,  by 
similar  triangles,  and  area  7r(10  — /i)^/25.  A  section  of  the  tank  at  depth  h  thus  has  volume 
approximately  7r(10  —  /i)^/25A/i  and  so  contains  an{10  —  /i)^/25A/i  kilograms  of  water, 
where  a  is  the  density  of  water  in  kilograms  per  cubic  meter;  a  ~  1000.  The  force  due  to 
gravity  on  this  much  water  is  9.8a7r(10  —  /i)^/25A/i,  and  finally,  this  section  of  water  must 
be  lifted  a  distance  h,  which  requires  /i9.8cr7r(10  —  /i)^/25A/j,  Newton-meters  of  work.  The 
total  work  is  therefore 


A  spring  has  a  "natural  length,"  its  length  if  nothing  is  stretching  or  compressing 
it.  If  the  spring  is  either  stretched  or  compressed  the  spring  provides  an  opposing  force; 
according  to  Hooke's  Law  the  magnitude  of  this  force  is  proportional  to  the  distance  the 
spring  has  been  stretched  or  compressed:  F  =  kx.  The  constant  of  proportionality,  k,  of 
course  depends  on  the  spring.  Note  that  x  here  represents  the  change  in  length  from  the 
natural  length. 

EXAMPLE  9.5.5  Suppose  A;  =  5  for  a  given  spring  that  has  a  natural  length  of  0.1 
meters.  Suppose  a  force  is  applied  that  compresses  the  spring  to  length  0.08.  What  is 
the  magnitude  of  the  force?  Assuming  that  the  constant  k  has  appropriate  dimensions 


Newton-meters. 


u 


(namely,  kg/s^),  the  force  is  5(0.1  -  0.08)  =  5(0.02)  =  0.1  Newtons. 


□ 
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EXAMPLE  9.5.6  How  much  work  is  done  in  compressing  the  spring  in  the  previous 
example  from  its  natural  length  to  0.08  meters?  From  0.08  meters  to  0.05  meters?  How 
much  work  is  done  to  stretch  the  spring  from  0.1  meters  to  0.15  meters?  We  can  approx- 
imate the  work  by  dividing  the  distance  that  the  spring  is  compressed  (or  stretched)  into 
small  subintervals.  Then  the  force  exerted  by  the  spring  is  approximately  constant  over  the 
subinterval,  so  the  work  required  to  compress  the  spring  from  Xi  to  Xi^i  is  approximately 
5{xi  —  0.1)Ax.  The  total  work  is  approximately 


n-l 


^5(xi  -0.1)Ax 


1=0 


and  in  the  limit 


r-0.08  p./    _  n  n2  0.08 

W=  I       5{x-0.1)dx=    ^  ' 
0.1  2 


5(0.08-0.1)2  5(0.1-0.1) 


1000 


N-m. 


The  other  values  we  seek  simply  use  different  limits.  To  compress  the  spring  from  0.08 
meters  to  0.05  meters  takes 


W 


0.05 


5(x  —  0.1)  dx 


hx^ 


0.08 


0.05 


0.08 


5(0.05-0.1)2  5(0.08-0.1) 


21 
4000 


N-m 


and  to  stretch  the  spring  from  0.1  meters  to  0.15  meters  requires 


W 


0.15 


h{x  —  0.1)  dx 


hx^ 


0.1 


0.15 


0.1 


5(0.15-0.1)2  5(0.1-0.1) 


160 


N-m. 


□ 


Exercises  9.5. 

1.  How  much  work  is  done  in  lifting  a  100  kilogram  weight  from  the  surface  of  the  earth  to  an 
orbit  35,786  kilometers  above  the  surface  of  the  earth?  =^ 

2.  How  much  work  is  done  in  lifting  a  100  kilogram  weight  from  an  orbit  1000  kilometers  above 
the  surface  of  the  earth  to  an  orbit  35,786  kilometers  above  the  surface  of  the  earth?  =^ 

3.  A  water  tank  has  the  shape  of  an  upright  cylinder  with  radius  r  =  1  meter  and  height  10 
meters.  If  the  depth  of  the  water  is  5  meters,  how  much  work  is  required  to  pump  all  the 
water  out  the  top  of  the  tank?  =^ 

4.  Suppose  the  tank  of  the  previous  problem  is  lying  on  its  side,  so  that  the  circular  ends  are 
vertical,  and  that  it  has  the  same  amount  of  water  as  before.  How  much  work  is  required 
to  pump  the  water  out  the  top  of  the  tank  (which  is  now  2  meters  above  the  bottom  of  the 
tank)?  ^ 
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5.  A  water  tank  has  the  shape  of  the  bottom  half  of  a  sphere  with  radius  r  =  1  meter.  If  the 
tank  is  full,  how  much  work  is  required  to  pump  all  the  water  out  the  top  of  the  tank? 

6.  A  spring  has  constant  A;  =  10  kg/s'^.  How  much  work  is  done  in  compressing  it  1/10  meter 
from  its  natural  length?  =^ 

7.  A  force  of  2  Newtons  will  compress  a  spring  from  1  meter  (its  natural  length)  to  0.8  meters. 
How  much  work  is  required  to  stretch  the  spring  from  1.1  meters  to  1.5  meters?  =^ 

8.  A  20  meter  long  steel  cable  has  density  2  kilograms  per  meter,  and  is  hanging  straight  down. 
How  much  work  is  required  to  lift  the  entire  cable  to  the  height  of  its  top  end?  =^> 

9.  The  cable  in  the  previous  problem  has  a  100  kilogram  bucket  of  concrete  attached  to  its  lower 
end.  How  much  work  is  required  to  lift  the  entire  cable  and  bucket  to  the  height  of  its  top 
end?  =^ 

10.  Consider  again  the  cable  and  bucket  of  the  previous  problem.  How  much  work  is  required 
to  lift  the  bucket  10  meters  by  raising  the  cable  10  meters?  (The  top  half  of  the  cable  ends 
up  at  the  height  of  the  top  end  of  the  cable,  while  the  bottom  half  of  the  cable  is  lifted  10 
meters.) 
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Suppose  a  beam  is  10  meters  long,  and  that  there  are  three  weights  on  the  beam:  a  10 
kilogram  weight  3  meters  from  the  left  end,  a  5  kilogram  weight  6  meters  from  the  left  end, 
and  a  4  kilogram  weight  8  meters  from  the  left  end.  Where  should  a  fulcrum  be  placed 
so  that  the  beam  balances?  Let's  assign  a  scale  to  the  beam,  from  0  at  the  left  end  to  10 
at  the  right,  so  that  we  can  denote  locations  on  the  beam  simply  as  x  coordinates;  the 
weights  are  at  x  =  3,  x  =  6,  and  x  =  8,  as  in  figure  9.6.1. 


10 

5 

4 

3 

A 

6 

8 

Figure  9.6.1     A  beam  with  three  masses. 

Suppose  to  begin  with  that  the  fulcrum  is  placed  at  x  =  5.  What  will  happen?  Each 
weight  applies  a  force  to  the  beam  that  tends  to  rotate  it  around  the  fulcrum;  this  effect 
is  measured  by  a  quantity  called  torque,  proportional  to  the  mass  times  the  distance 
from  the  fulcrum.  Of  course,  weights  on  different  sides  of  the  fulcrum  rotate  the  beam 
in  opposite  directions.  We  can  distinguish  this  by  using  a  signed  distance  in  the  formula 
for  torque.  So  with  the  fulcrum  at  5,  the  torques  induced  by  the  three  weights  will  be 
proportional  to  (3  —  5)10  =  —20,  (6  —  5)5  =  5,  and  (8  —  5)4  =  12.  For  the  beam  to 
balance,  the  sum  of  the  torques  must  be  zero;  since  the  sum  is  —20  +  5  +  12  =  —3,  the 
beam  rotates  counter-clockwise,  and  to  get  the  beam  to  balance  we  need  to  move  the 
fulcrum  to  the  left.  To  calculate  exactly  where  the  fulcrum  should  be,  we  let  x  denote  the 
location  of  the  fulcrum  when  the  beam  is  in  balance.  The  total  torque  on  the  beam  is  then 
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(3  —  a;)10  +  (6  —  x)5  +  (8  —  x)4  =  92  —  19x.  Since  the  beam  balances  at  x  it  must  be  that 
92  -  19x  =  0  or  X  =  92/19  4.84,  that  is,  the  fulcrum  should  be  placed  at  x  =  92/19  to 
balance  the  beam. 

Now  suppose  that  we  have  a  beam  with  varying  density — some  portions  of  the  beam 
contain  more  mass  than  other  portions  of  the  same  size.  We  want  to  figure  out  where  to 
put  the  fulcrum  so  that  the  beam  balances. 


mo 

mi 

m2 

ma 

m4 

ms 

me 

mj 

ms 

mg 

Figure  9.6.2     A  solid  beam. 


EXAMPLE  9.6.1  Suppose  the  beam  is  10  meters  long  and  that  the  density  is  1  +  a; 
kilograms  per  meter  at  location  x  on  the  beam.  To  approximate  the  solution,  we  can  think 
of  the  beam  as  a  sequence  of  weights  "on"  a  beam.  For  example,  we  can  think  of  the 
portion  of  the  beam  between  a;  =  0  and  x  =  1  as  a  weight  sitting  at  x  =  0,  the  portion 
between  x  =  1  and  a;  =  2  as  a  weight  sitting  at  x  =  1,  and  so  on,  as  indicated  in  figure  9.6.2. 
We  then  approximate  the  mass  of  the  weights  by  assuming  that  each  portion  of  the  beam 
has  constant  density.  So  the  mass  of  the  first  weight  is  approximately  mo  =  (1  +  0)1  =  1 
kilograms,  namely,  (1  +  0)  kilograms  per  meter  times  1  meter.  The  second  weight  is 
mi  =  (1  +  1)1  =  2  kilograms,  and  so  on  to  the  tenth  weight  with  mg  =  (1  +  9)1  =  10 
kilograms.  So  in  this  case  the  total  torque  is 

(0  -  x)mo  +  (1  -  x)mi  H  h  (9  -  x)m9  =  (0  -  x)l  +  (1  -  x)2  H  h  (9  -  x)10. 

If  we  set  this  to  zero  and  solve  for  x  we  get  x  =  6.  In  general,  if  we  divide  the  beam  into 
n  portions,  the  mass  of  weight  number  i  will  be  =  (1  +  Xi){xi+i  —  Xi)  =  (1  +  Xi)Ax 
and  the  torque  induced  by  weight  number  i  will  be  (xi  —  x)mi  =  {xi  —  x)(l  +  Xi)Ax.  The 
total  torque  is  then 

(xo  —  x)(l  +  xo)Ax  +  (xi  —  x)(l  +  xi)Ax  +  ■  ■  ■  +  (x^-i  —  x)(l  +  x„_i)Ax 

n— 1  n—1 
=         Xi{l  +  Xi)Ax  —         x(l  +  Xi)Ax 

i=0  i=0 

n—1  n—1 
=  Xi(l  +  Xj)Ax  —  X  ^^(1  +  Xi)Ax. 

i=0  j=0 
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If  we  set  this  equal  to  zero  and  solve  for  x  we  get  an  approximation  to  the  balance  point 
of  the  beam: 


n— 1  n— 1 

'^X 

n—l  n— 1 

X 

1=0  i=0 
n-1 


0  =  Xi{l  +  Xi)Ax  -  X  +  Xi)A2 

i=0 

n-1 

y^(l  +  Xi)Ax  =  ^  Xi{l  +  Xi)Ax 
i=0 
n-1 


1=0 

X 


^(1  +  a;i)Aa; 


i=0 

The  denominator  of  this  fraction  has  a  very  familiar  interpretation.  Consider  one  term  of 
the  sum  in  the  denominator:  (1  +  Xi)Ax.  This  is  the  density  near  Xi  times  a  short  length, 
Ax,  which  in  other  words  is  approximately  the  mass  of  the  beam  between  Xi  and  Xj+i. 
When  we  add  these  up  we  get  approximately  the  mass  of  the  beam. 

Now  each  of  the  sums  in  the  fraction  has  the  right  form  to  turn  into  an  integral,  which 
in  turn  gives  us  the  exact  value  of  x: 

/•lO 

/     x{l  +  x)  dx 
x=^' 


/    {1  +  x)  dx 
Jo 

The  numerator  of  this  fraction  is  called  the  moment  of  the  system  around  zero: 

/■'°  ^  ^        /"'^         2  ^  1150 

/     x{l  +  X)  dx  =        X  +  X  dx  - 

Jo  Jo 


3  ' 

and  the  denominator  is  the  mass  of  the  beam: 

10 

(1  +  x)dx^  60, 

0 

and  the  balance  point,  officially  called  the  center  of  mass,  is 

1^1  =  1^»  6.39. 
3    60  18 


□ 
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It  should  be  apparent  that  there  was  nothing  special  about  the  density  function  (t{x)  = 
1  +  a;  or  the  length  of  the  beam,  or  even  that  the  left  end  of  the  beam  is  at  the  origin. 
In  general,  if  the  density  of  the  beam  is  a{x)  and  the  beam  covers  the  interval  [a,  6],  the 
moment  of  the  beam  around  zero  is 


and  the  total  mass  of  the  beam  is 


M  =  j  a{x)  dx 

J  a 


and  the  center  of  mass  is  at 

_  Mo 


EXAMPLE  9.6.2  Suppose  a  beam  lies  on  the  a;-axis  between  20  and  30,  and  has  density 
function  a{x)  =  x  —  19.  Find  the  center  of  mass.  This  is  the  same  as  the  previous  example 
except  that  the  beam  has  been  moved.  Note  that  the  density  at  the  left  end  is  20  —  19  =  1 
and  at  the  right  end  is  30  —  19  =  11,  as  before.  Hence  the  center  of  mass  must  be  at 
approximately  20  +  6.39  =  26.39.  Let's  see  how  the  calculation  works  out. 


Mo 


M 


/.30 
/ 

J  20 


x{x  —  19)  dx 


20 
30 


30 


x'^  —  19x  dx 


20 


X 

y 


30 


20 


4750 


/■csu 
J  20 


19  dx  = 
4750  1  475 


X 


-  19x 


Mo  ^  

M        3  60 


18 


2 

26.39. 


30 


20 


60 


□ 


EXAMPLE  9.6.3  Suppose  a  fiat  plate  of  uniform  density  has  the  shape  contained  by 
y  =  x^,  y  =  1,  and  x  =  0,  in  the  first  quadrant.  Find  the  center  of  mass.  (Since  the  density 
is  constant,  the  center  of  mass  depends  only  on  the  shape  of  the  plate,  not  the  density,  or 
in  other  words,  this  is  a  purely  geometric  quantity.  In  such  a  case  the  center  of  mass  is 
called  the  centroid.) 

This  is  a  two  dimensional  problem,  but  it  can  be  solved  as  if  it  were  two  one  dimensional 
problems:  we  need  to  find  the  x  and  y  coordinates  of  the  center  of  mass,  x  and  y,  and 
fortunately  we  can  do  these  independently.  Imagine  looking  at  the  plate  edge  on,  from 
below  the  a;-axis.  The  plate  will  appear  to  be  a  beam,  and  the  mass  of  a  short  section 
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Figure  9.6.3     Center  of  mass  for  a  two  dimensional  plate. 


of  the  "beam",  say  between  Xi  and  is  the  mass  of  a  strip  of  the  plate  between  Xi 

and  Xi+i.  See  figure  9.6.3  showing  the  plate  from  above  and  as  it  appears  edge  on.  Since 
the  plate  has  uniform  density  we  may  as  well  assume  that  a  =  1.  Then  the  mass  of  the 


plate  between  Xi  and  Xi^i  is  approximately 
compute  the  moment  around  the  y-axis: 


Xa 


) Ax  =  (1  —  x1)^x.  Now  we  can 


x(l  —  x"^)  dx 


1 

4 


and  the  total  mass 


and  finally 


M=  f  {l-x'^)dx 
Jo 


13  3 


X 


2 
3 


42  8 

Next  we  do  the  same  thing  to  find  y.  The  mass  of  the  plate  between  yi  and  yi+i  is 
approximately  rii  =  i/yAy,  so 


M^=  I  yy/ydy 

Jo 


2 

5 


and 


23 
5  2 


3 

5' 


since  the  total  mass  M  is  the  same.  The  center  of  mass  is  shown  in  figure  9.6.3. 


□ 


EXAMPLE  9.6.4  Find  the  center  of  mass  of  a  thin,  uniform  plate  whose  shape  is  the 
region  between  y  =  cosx  and  the  x-axis  between  x  =  —n/2  and  x  =  7r/2.   It  is  clear 
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that  a;  =  0,  but  for  practice  let's  compute  it  anyway.  We  will  need  the  total  mass,  so  we 
compute  it  first: 

r/2  ^/2 


=  2. 

-7r/2 


M  =  cos  xdx  —  sin  x 

i-7r/2 

The  moment  around  the  y-axis  is 

"^/S  7r/2 

0 

r/2 

and  the  moment  around  the  x-axis  is 


-7r/2 


1 

2 


.X  I  o  ^2  1  -  y  ,  arcsiny 
M^;  =  /    y  ■  1  arccos  yay=y  arccos  y  —  1  


0 


TT 

4' 


X  =         y  =  —  ^  0.393. 


□ 


Thus 

2  8 

Exercises  9.6. 

1.  A  beam  10  meters  long  has  density  (j{x)  =      at  distance  x  from  tlie  left  end  of  the  beam. 
Find  the  center  of  mass  x.  =^ 

2.  A  beam  10  meters  long  has  density  cr^x)  =  sin(7rx/10)  at  distance  x  from  the  left  end  of  the 
beam.  Find  the  center  of  mass  x. 

3.  A  beam  4  meters  long  has  density  (t{x)  =  x^  at  distance  x  from  the  left  end  of  the  beam. 
Find  the  center  of  mass  x.  =^> 


^  , ,   .  p     ,       /"  „  ,         2  x\/l  —  x^      arcsin  x  _ 

4.  Veriiy  that  /  2x  arccos  xdx  =  x  arccos  x  ^  1  ^  h  G . 

5.  A  thin  plate  lies  in  the  region  between  y  =  x^  and  the  x-axis  between  x  =  1  and  x  =  2.  Find 
the  centroid.  =^> 

6.  A  thin  plate  fills  the  upper  half  of  the  unit  circle  x^  +  y'^  =  1.  Find  the  centroid. 

7.  A  thin  plate  lies  in  the  region  contained  hy  y  =  x  and  y  =  x^ .  Find  the  centroid.  =^ 

8.  A  thin  plate  lies  in  the  region  contained  by  y  =  4  —  a;'^  and  the  x-axis.  Find  the  centroid.  =^ 

1/3 

9.  A  thin  plate  lies  in  the  region  contained  by  y  =  a;  '    and  the  x-axis  between  x  =  0  and  x  =  1. 
Find  the  centroid.  =^> 

10.  A  thin  plate  lies  in  the  region  contained  by  ^/x  +  ^/y  =  1  and  the  axes  in  the  first  quadrant. 
Find  the  centroid.  =^> 

11.  A  thin  plate  lies  in  the  region  between  the  circle  x'^  +  y"^  =4  and  the  circle  x"^  +  y'^  =  1, 


above  the  x-axis.  Find  the  centroid. 

A  thin  plate  lies  in  the  region  betwe 
the  first  quadrant.  Find  the  centroid. 

A  thin  plate  lies  in  the  region  betwe 
above  the  x-axis.  Find  the  centroid. 


12.  A  thin  plate  lies  in  the  region  between  the  circle  x^  +  y'^  =  4  and  the  circle  x^  +  y'^  =  1  in 

13.  A  thin  plate  lies  in  the  region  between  the  circle  x^  +  y'^  =  25  and  the  circle  x'^  +  y'^  =  16 
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9.7    Kinetic  energy:  improper  integrals 


Recall  example  9.5.3  in  which  we  computed  the  work  required  to  lift  an  object  from  the 
surface  of  the  earth  to  some  large  distance  D  away.  Since  F  —  k/x^  we  computed 


We  noticed  that  as  D  increases,  k/ D  decreases  to  zero  so  that  the  amount  of  work  increases 
to  k/vQ.  More  precisely, 


We  might  reasonably  describe  this  calculation  as  computing  the  amount  of  work  required 
to  lift  the  object  "to  infinity,"  and  abbreviate  the  limit  as 


Such  an  integral,  with  a  limit  of  infinity,  is  called  an  improper  integral.  This  is  a  bit 
unfortunate,  since  it's  not  really  "improper"  to  do  this,  nor  is  it  really  "an  integral" — it  is 
an  abbreviation  for  the  limit  of  a  particular  sort  of  integral.  Nevertheless,  we're  stuck  with 
the  term,  and  the  operation  itself  is  perfectly  legitimate.  It  may  at  first  seem  odd  that  a 
finite  amount  of  work  is  sufficient  to  lift  an  object  to  "infinity" ,  but  sometimes  surprising 
things  are  nevertheless  true,  and  this  is  such  a  case.  If  the  value  of  an  improper  integral 
is  a  finite  number,  as  in  this  example,  we  say  that  the  integral  converges,  and  if  not  we 
say  that  the  integral  diverges. 

Here's  another  way,  perhaps  even  more  surprising,  to  interpret  this  calculation.  We 
know  that  one  interpretation  of 


is  the  area  under  y  =  from  a;  =  1  to  a; 

increases.  But  since 


D.  Of  course,  as  D  increases  this  area 


while  the  area  increases,  it  never  exceeds  1,  that  is 


The  area  of  the  infinite  region  under  y  =  l/x^  from  x  =  1  to  infinity  is  finite. 
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Consider  a  slightly  different  sort  of  improper  integral:    /     xe  ^  dx.  There  are  two 

J  —  oo 

ways  wc  might  try  to  compute  this.  First,  we  could  break  it  up  into  two  more  familiar 
integrals: 


r°°  _    2  _    2  /■  _  2 

/     xe  ^  dx  =        xe  ^  dx  +        xe  ^  dx. 

J —oo  J —oo  Jo 


Now  we  do  these  as  before: 


and 


0  -x^ 

-X^  J  T  6 

xe      dx  =  lim  

oo  2 


2  e 
xe~^  dx  —  lim  

D-)-oo  2 


D 


D 


1 

2' 


1 

2' 


so 


-x^  ^  1         1  n 

xe      dx  =  —-  +  -  —  0. 


Alternately,  we  might  try 

/oo  ^ 
xe~^  dx  =  lim  / 
-oo  D^ooJ_jj 


xe  ^  dx  =  lim  —- — 

D— >-oo  2 


D 


=  lim  h 

D— ^oo  2 


=  0. 


So  we  get  the  same  answer  either  way.  This  does  not  always  happen;  sometimes  the  second 
approach  gives  a  finite  number,  while  the  first  approach  does  not;  the  exercises  provide 

/oo 
f{x)  dx  according  to  the  first  method: 
-oo 

/a  foo 
f{x)  dx  and   /     f{x)  dx  must  converge  for  the  original  integral  to 
-oo  J  a 

converge.  The  second  approach  does  turn  out  to  be  useful;  when  lim    /     f{x)  dx  =  L, 

D^oo  J_^ 

/oo 
f{x)  dx. 
-oo 

Here's  a  more  concrete  application  of  these  ideas.  We  know  that  in  general 


l-Xl 

W=  Fdx 

Jxo 


is  the  work  done  against  the  force  F  in  moving  from  xq  to  xi.  In  the  case  that  F  is  the 
force  of  gravity  exerted  by  the  earth,  it  is  customary  to  make  F  <  0  since  the  force  is 
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"downward."  This  makes  the  work  W  negative  when  it  should  be  positive,  so  typically  the 
work  in  this  case  is  defined  as 


W  =  -  I  Fdx. 


-I 

Also,  by  Newton's  Law,  F  =  ma{t).  This  means  that 

rxi 

W  =  —        ma{t)  dx. 

J  Xo 

Unfortunately  this  integral  is  a  bit  problematic:  a{t)  is  in  terms  of  t,  while  the  limits  and 
the  "da;"  are  in  terms  of  x.  But  x  and  t  are  certainly  related  here:  x  =  x{t)  is  the  function 
that  gives  the  position  of  the  object  at  time  t,  so  v  =  v{t)  =  dx/dt  =  x'{t)  is  its  velocity 
and  a{t)  =  v'{t)  =  x"{t).  We  can  use  v  =  x'{t)  as  a  substitution  to  convert  the  integral 
from  "da;"  to  "dv"  in  the  usual  way,  with  a  bit  of  cleverness  along  the  way: 

dv  =  x"{t)  dt  =  ait)  dt  =  a(t)  ^  dx 

dx 

dx 

—  dv  =  a{t)  dx 
vdv  =  a{t)  dx. 

Substituting  in  the  integral: 

W  =  —        ma{t)  dx  =  —        mv  dv  = 

J  Xn  Jvn 


Vl  2  2 


Vo 


You  may  recall  seeing  the  expression  in  a  physics  course — it  is  called  the  kinetic 

energy  of  the  object.  We  have  shown  here  that  the  work  done  in  moving  the  object  from 
one  place  to  another  is  the  same  as  the  change  in  kinetic  energy. 

We  know  that  the  work  required  to  move  an  object  from  the  surface  of  the  earth  to 
infinity  is 

-f 


W=    r  ^dT=-. 

TO      ^  ^0 


At  the  surface  of  the  earth  the  acceleration  due  to  gravity  is  approximately  9.8  meters 
per  second  squared,  so  the  force  on  an  object  of  mass  m  is  F  =  9.8m.  The  radius  of  the 
earth  is  approximately  6378.1  kilometers  or  6378100  meters.  Since  the  force  due  to  gravity 
obeys  an  inverse  square  law,  F  =  k/r^  and  9.8m  =  A;/6378100^,  k  =  398665564178000m 
axidW  =  62505380m. 
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Now  suppose  that  the  initial  velocity  of  the  object,  vq,  is  just  enough  to  get  it  to 
infinity,  that  is,  just  enough  so  that  the  object  never  slows  to  a  stop,  but  so  that  its  speed 
decreases  to  zero,  i.e.,  so  that  vi  =  0.  Then 

mv?      mvn  mvn 

62505380m  =  W  =  ^  +  ^  =   ^ 

2  2  2 

so 

Vq  =  v^l25010760  ~  11181    meters  per  second, 

or  about  40251  kilometers  per  hour.  This  speed  is  called  the  escape  velocity.  Notice  that 
the  mass  of  the  object,  m,  canceled  out  at  the  last  step;  the  escape  velocity  is  the  same 
for  all  objects.  Of  course,  it  takes  considerably  more  energy  to  get  a  large  object  up  to 
40251  kph  than  a  small  one,  so  it  is  certainly  more  difficult  to  get  a  large  object  into  deep 
space  than  a  small  one.  Also,  note  that  while  we  have  computed  the  escape  velocity  for 
the  earth,  this  speed  would  not  in  fact  get  an  object  "to  infinity"  because  of  the  large  mass 
in  our  neighborhood  called  the  sun.  Escape  velocity  for  the  sun  starting  at  the  distance  of 
the  earth  from  the  sun  is  nearly  4  times  the  escape  velocity  we  have  calculated. 

Exercises  9.7. 

1.  Is  the  area  under  y  =  1/x  from  1  to  infinity  finite  or  infinite?  If  finite,  compute  the  area.  =^> 

2.  Is  the  area  under  y  =  from  1  to  infinity  finite  or  infinite?  If  finite,  compute  the  area. 


4.  Does  /     l/y/xdx  converge  or  diverge?  If  it  converges,  find  the  value. 


3. 

Does 

4. 

Does 

5. 

Does 

/■1/2 

i: 


oo 

—  X 


L 


and  determine  whether  it  converges  or  diverges;  if  it  converges,  find  the  value. 

7.  Does  /    1/^/xdx  converge  or  diverge?  If  it  converges,  find  the  value.  =^ 

^0 

8.  Does  /  sec  x  dx  converge  or  diverge?  If  it  converges,  find  the  value.  =^ 

Jo 

9.  Does  /  — ^  dx  converge  or  diverge?  If  it  converges,  find  the  value. 


4  +  3;*^ 

10.  Does  /      xdx  converge  or  diverge?  If  it  converges,  find  the  value.  Also  find  the  Cauchy 


oo 
oo 


Principal  Value,  if  it  exists. 

/oo 
■smxdx  converge  or  diverge?  If  it  converges,  find  the  value.  Also  find  the  Cauchy 
-  oo 

Principal  Value,  if  it  exists. 
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12.  Does  /  cosxdx  converge  or  diverge?  If  it  converges,  find  the  value.  Also  find  the  Cauchy 
Principal  Value,  if  it  exists.  =^ 

13.  Suppose  the  curve  y  =  1/x  is  rotated  around  the  x-axis  generating  a  sort  of  funnel  or  horn 
shape,  called  Gabriel's  horn  or  Toricelli's  trumpet.  Is  the  volume  of  this  funnel  from 
X  =  1  to  infinity  finite  or  infinite?  If  finite,  compute  the  volume. 

14.  An  officially  sanctioned  baseball  must  be  between  142  and  149  grams.  How  much  work, 
in  Newton-meters,  does  it  take  to  throw  a  ball  at  80  miles  per  hour?  At  90  mph?  At 
100.9  mph?  (According  to  the  Guinness  Book  of  World  Records,  at  http: //www. baseball- 
almanac.  com/recbooks/rb_guin.  shtml,  "The  greatest  reliably  recorded  speed  at  which  a 
baseball  has  been  pitched  is  100.9  mph  by  Lynn  Nolan  Ryan  (California  Angels)  at  Anaheim 
Stadium  in  California  on  August  20,  1974.") 


9.8  Probability 

You  perhaps  have  at  least  a  rudimentary  understanding  of  discrete  probability,  which 
measures  the  likelihood  of  an  "event"  when  there  are  a  finite  number  of  possibilities.  For 
example,  when  an  ordinary  six-sided  die  is  rolled,  the  probability  of  getting  any  particular 
number  is  1/6.  In  general,  the  probability  of  an  event  is  the  number  of  ways  the  event  can 
happen  divided  by  the  number  of  ways  that  "anything"  can  happen. 

For  a  slightly  more  complicated  example,  consider  the  case  of  two  six-sided  dice.  The 
dice  are  physically  distinct,  which  means  that  rolling  a  2-5  is  different  than  rolling  a  5-2; 
each  is  an  equally  likely  event  out  of  a  total  of  36  ways  the  dice  can  land,  so  each  has  a 
probability  of  1/36. 

Most  interesting  events  are  not  so  simple.  More  interesting  is  the  probability  of  rolling 
a  certain  sum  out  of  the  possibilities  2  through  12.  It  is  clearly  not  true  that  all  sums  are 
equally  likely:  the  only  way  to  roll  a  2  is  to  roll  1-1,  while  there  are  many  ways  to  roll  a  7. 
Because  the  number  of  possibilities  is  quite  small,  and  because  a  pattern  quickly  becomes 
evident,  it  is  easy  to  see  that  the  probabilities  of  the  various  sums  are: 


P(2)  = 

P(12) 

=  1/36 

P(3)  = 

P(ll) 

=  2/36 

P(4)  = 

P(10) 

=  3/36 

P(5)  = 

=  P(9) 

=  4/36 

P(6)  = 

=  P(8) 

=  5/36 

P(7) 

=  6/36 

Here  we  use  P{n)  to  mean  "the  probability  of  rolling  an  n."  Since  we  have  correctly 
accounted  for  all  possibilities,  the  sum  of  all  these  probabilities  is  36/36  =  1;  the  probability 
that  the  sum  is  one  of  2  through  12  is  1,  because  there  are  no  other  possibilities. 
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The  study  of  probability  is  concerned  with  more  difficult  questions  as  well;  for  example, 
suppose  the  two  dice  are  rolled  many  times.  On  the  average,  what  sum  will  come  up?  In 
the  language  of  probability,  this  average  is  called  the  expected  value  of  the  sum.  This  is 
at  first  a  little  misleading,  as  it  does  not  tell  us  what  to  "expect"  when  the  two  dice  are 
rolled,  but  what  we  expect  the  long  term  average  will  be. 

Suppose  that  two  dice  are  rolled  36  million  times.  Based  on  the  probabilities,  we  would 
expect  about  1  million  rolls  to  be  2,  about  2  million  to  be  3,  and  so  on,  with  a  roll  of  7 
topping  the  list  at  about  6  million.  The  sum  of  all  rolls  would  be  1  million  times  2  plus  2 
million  times  3,  and  so  on,  and  dividing  by  36  million  we  would  get  the  average: 

X  =  (2  •  10^  +  3(2  •  10^)  +  •  •  •  +  7(6  •  10^)  +  •  •  •  +  12  •  10^)—^ 

36  •  lU" 

^10^        .2-106  „  6-106  10^ 

36-106  ^   36-106^  36-106^  36  - 10^ 

=  2P(2)  +  3P(3)  +  -  -  -  +  7P(7)  +  -  -  -  +  12P(12) 

=  f;zP(z)  =  7. 

There  is  nothing  special  about  the  36  million  in  this  calculation.  No  matter  what  the 

12 

number  of  rolls,  once  we  simplify  the  average,  we  get  the  same       iP{i).  While  the  actual 

i=2 

average  value  of  a  large  number  of  rolls  will  not  be  exactly  7,  the  average  should  be  close 
to  7  when  the  number  of  rolls  is  large.  Turning  this  around,  if  the  average  is  not  close  to 
7,  we  should  suspect  that  the  dice  are  not  fair. 

A  variable,  say  X,  that  can  take  certain  values,  each  with  a  corresponding  probability, 
is  called  a  random  variable;  in  the  example  above,  the  random  variable  was  the  sum  of 
the  two  dice.  If  the  possible  values  for  X  are  •  •  i-^m 

then  the  expected  value  of  the 

n 

random  variable  is  E{X)  =  ^^^iPi^i)-  The  expected  value  is  also  called  the  mean. 

i=l 

When  the  number  of  possible  values  for  X  is  finite,  we  say  that  X  is  a  discrete  random 
variable.  In  many  applications  of  probability,  the  number  of  possible  values  of  a  random 
variable  is  very  large,  perhaps  even  infinite.  To  deal  with  the  infinite  case  we  need  a 
different  approach,  and  since  there  is  a  sum  involved,  it  should  not  be  wholly  surprising 
that  integration  turns  out  to  be  a  useful  tool.  It  then  turns  out  that  even  when  the 
number  of  possibilities  is  large  but  finite,  it  is  frequently  easier  to  pretend  that  the  number 
is  infinite.  Suppose,  for  example,  that  a  dart  is  thrown  at  a  dart  board.  Since  the  dart 
board  consists  of  a  finite  number  of  atoms,  there  are  in  some  sense  only  a  finite  number 
of  places  for  the  dart  to  land,  but  it  is  easier  to  explore  the  probabilities  involved  by 
pretending  that  the  dart  can  land  on  any  point  in  the  usual  x-y  plane. 
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DEFINITION  9.8.1     Let  /  :  M  ^  M  be  a  function.   If  f{x)  >  0  for  every  x  and 

/CX3 
f{x)  dx  —  1  then  /  is  a  probability  density  function.  □ 
-oo 

We  associate  a  probability  density  function  with  a  random  variable  X  by  stipulating 
that  the  probability  that  X  is  between  a  and  b  is       f{x)  dx.  Because  of  the  requirement 

J  a 

that  the  integral  from  —  oo  to  oo  be  1,  all  probabilities  are  less  than  or  equal  to  1,  and  the 
probability  that  X  takes  on  some  value  between  — oo  and  oo  is  1,  as  it  should  be. 

EXAMPLE  9.8.2  Consider  again  the  two  dice  example;  we  can  view  it  in  a  way  that 
more  resembles  the  probability  density  function  approach.  Consider  a  random  variable  X 
that  takes  on  any  real  value  with  probabilities  given  by  the  probability  density  function  in 
figure  9.8.1.  The  function  /  consists  of  just  the  top  edges  of  the  rectangles,  with  vertical 
sides  drawn  for  clarity;  the  function  is  zero  below  1.5  and  above  12.5.  The  area  of  each 
rectangle  is  the  probability  of  rolling  the  sum  in  the  middle  of  the  bottom  of  the  rectangle, 


or 


P{n)  = 

The  probability  of  rolling  a  4,  5,  or  6  is 


1+1/2 


f{x)  dx. 


n-l/2 


Pin) 


13/2 


/(x)  dx. 


7/2 


Of  course,  we  could  also  compute  probabilities  that  don't  make  sense  in  the  context  of  the 
dice,  such  as  the  probability  that  X  is  between  4  and  5.8.  □ 


6 
5 
4 
3 
2 
1 


123456789       10      11  12 
Figure  9.8.1     A  probability  density  function  for  two  dice. 


4 


6 


7 
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The  function 


F{x)  =  P{X<x)=  I  f{t)dt 
is  called  the  cumulative  distribution  function  or  simply  (probability)  distribution. 


EXAMPLE  9.8.3    Suppose  that  a  <  6  and 

1 


if  a  <  a;  <  6 


fix)  =  \  b-a 

0  otherwise. 

Then  f{x)  is  the  uniform  probability  density  function  on  [a,  b].  and  the  corresponding 
distribution  is  the  uniform  distribution  on  [a,  6].  n 

EXAMPLE  9.8.4    Consider  the  function  f{x)  =  What  can  we  say  about 

/oo 
-oo 

We  cannot  find  an  antiderivative  of  /,  but  wc  can  see  that  this  integral  is  some  finite 
number.  Notice  that  0  <  f{x)  =  <  e~^^^  for  |a;|  >  1.  This  implies  that  the  area 

under  is  less  than  the  area  under  e~^/^,  over  the  interval  [l,oo).   It  is  easy  to 

compute  the  latter  area,  namely 

/  e--/Ux=^, 
Ji  Ve 

so 

/■oo 

e--'/^dx 


L 


is  some  finite  number  smaller  than  Because  /  is  symmetric  around  the  j/-axis, 

- 1  /•oo 


1 


This  means  that 


e-^'/2da;=  /     e-'''/^dx+  /    e-'='/^dx+  /     e-''^/^dx  =  A 

-oo  J —  oo  J —  1  Jl 

for  some  finite  positive  number  A.  Now  if  we  let  g{x)  =  f{x)/A, 


/oo  poo  ^  2 

g{x)dx^—  e'"^^  1"^  dx= —A=\, 
-oo  ^  J -oo  ^ 


so  5r  is  a  probability  density  function.  It  turns  out  to  be  very  useful,  and  is  called  the 
standard  normal  probability  density  function  or  more  informally  the  bell  curve. 
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0.5  n 


-4-3-2-10       1       2       3  4 
Figure  9.8.2     The  bell  curve. 

giving  rise  to  the  standard  normal  distribution.  See  figure  9.8.2  for  the  graph  of  the 
bell  curve.  □ 

We  have  shown  that  A  is  some  finite  number  without  computing  it;  we  cannot  compute 
it  with  the  techniques  we  have  available.  By  using  some  techniques  from  multivariable 
calculus,  it  can  be  shown  that  A  =  V2n. 


EXAMPLE  9.8.5    The  exponential  distribution  has  probability  density  function 

where  c  is  a  positive  constant.  □ 

The  mean  or  expected  value  of  a  random  variable  is  quite  useful,  as  hinted  at  in 
our  discussion  of  dice.  Recall  that  the  mean  for  a  discrete  random  variable  is  E{X)  = 

n 

^^XiP(xi).  In  the  more  general  context  we  use  an  integral  in  place  of  the  sum. 

1=1 


DEFINITION  9.8.6    The  mean  of  a  random  variable  X  with  probability  density  func- 

□ 


/oo 
xf{x)  dx,  provided  the  integral  converges. 
-  oo 


When  the  mean  exists  it  is  unique,  since  it  is  the  result  of  an  explicit  calculation.  The 
mean  does  not  always  exist. 

The  mean  might  look  familiar;  it  is  essentially  identical  to  the  center  of  mass  of  a  one- 
dimensional  beam,  as  discussed  in  section  9.6.  The  probability  density  function  /  plays 
the  role  of  the  physical  density  function,  but  now  the  "beam"  has  infinite  length.  If  we 
consider  only  a  finite  portion  of  the  beam,  say  between  a  and  b,  then  the  center  of  mass  is 

f' 

/   xf{x)  dx 

—  .In 

X  = 


b 

f{x)  dx 
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If  we  extend  the  beam  to  infinity,  we  get 

/oo 
xf{x)  dx 
-oo 


/oo 
xf{x)dx  =  E{X), 

I  J\x)ax 

J —oo 


/oo 
f{x)dx  =  1.  In  the  center  of  mass  interpretation,  this  integral  is  the  total 
-oo 

mass  of  the  beam,  which  is  always  1  when  /  is  a  probability  density  function. 
EXAMPLE  9.8.7    The  mean  of  the  standard  normal  distribution  is 

/•oo  g-xV2 

/     X — dx. 
J-oo  v27r 

We  compute  the  two  halves: 

X  — dx  =  lim 


27T  D^-oo  W2tt 


and 


f 

Jo 


D 


oo    g-xV2  e""^""/^  ^ 

X — dx  =  lim 


2tt 


The  sum  of  these  is  0,  which  is  the  mean.  □ 

While  the  mean  is  very  useful,  it  typically  is  not  enough  information  to  properly 
evaluate  a  situation.  For  example,  suppose  we  could  manufacture  an  11-sided  die,  with 
the  faces  numbered  2  through  12  so  that  each  face  is  equally  likely  to  be  down  when  the  die 
is  rolled.  The  value  of  a  roll  is  the  value  on  this  lower  face.  Rolling  the  die  gives  the  same 
range  of  values  as  rolling  two  ordinary  dice,  but  now  each  value  occurs  with  probability 
1/11.  The  expected  value  of  a  roll  is 

2       3  12  ^ 

n  +  n+---+ii=^- 

The  mean  does  not  distinguish  the  two  cases,  though  of  course  they  are  quite  different. 

If  /  is  a  probability  density  function  for  a  random  variable  X,  with  mean  we  would 
like  to  measure  how  far  a  "typical"  value  of  X  is  from  fj,.  One  way  to  measure  this  distance 
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is  {X  —  jif-^  we  square  the  difference  so  as  to  measure  all  distances  as  positive.  To  get  the 
typical  such  squared  distance,  we  compute  the  mean.  For  two  dice,  for  example,  we  get 

Because  we  squared  the  differences  this  does  not  directly  measure  the  typical  distance  we 
seek;  if  we  take  the  square  root  of  this  we  do  get  such  a  measure,  \/35/36  ~  2.42.  Doing 
the  computation  for  the  strange  11-sided  die  we  get 

(2-7)^1 +  (3- 7)^^  + •••  +  (7- 7)^^  + •••(11 -7)^^ +  (12-7)^1  =  10, 

with  square  root  approximately  3.16.  Comparing  2.42  to  3.16  tells  us  that  the  two-dice 
rolls  clump  somewhat  more  closely  near  7  than  the  rolls  of  the  weird  die,  which  of  course 
we  already  knew  because  these  examples  are  quite  simple. 

To  perform  the  same  computation  for  a  probability  density  function  the  siim  is  replaced 
by  an  integral,  just  as  in  the  computation  of  the  mean.  The  expected  value  of  the  squared 
distances  is 


/oo 
-oo 


-oo 

called  the  variance.  The  square  root  of  the  variance  is  the  standard  deviation,  denoted 
a. 

EXAMPLE  9.8.8  We  compute  the  standard  deviation  of  the  standard  normal  distru- 
bution.  The  variance  is 


27r  J-oo 


To  compute  the  antiderivative,  use  integration  by  parts,  with  u  =  x  and  dv  =  xe~^  '  dx. 
This  gives 

j  x^e-''"/^dx=-xe-''"/^  +  J  e-^'/^dx. 

We  cannot  do  the  new  integral,  but  we  know  its  value  when  the  limits  are  — oo  to  oo,  from 
our  discussion  of  the  standard  normal  distribution.  Thus 

x^e-^"/Ux=  -^xe-^"/^       +^  e-^"/'dx^0+^y/2^=l. 
v27r  J-oo  v27r  _^     v27r  J-oo  y27r 

The  standard  deviation  is  then  a/I  =  1.  □ 

Here  is  a  simple  example  showing  how  these  ideas  can  be  useful.  Suppose  it  is  known 
that,  in  the  long  run,  1  out  of  every  100  computer  memory  chips  produced  by  a  certain 
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manufacturing  plant  is  defective  when  the  manufacturing  process  is  running  correctly. 
Suppose  1000  chips  are  selected  at  random  and  15  of  them  are  defective.  This  is  more 
than  the  'expected'  number  (10),  but  is  it  so  many  that  we  should  suspect  that  something 
has  gone  wrong  in  the  manufacturing  process?  We  are  interested  in  the  probability  that 
various  numbers  of  defective  chips  arise;  the  probability  distribution  is  discrete:  there 
can  only  be  a  whole  number  of  defective  chips.  But  (under  reasonable  assumptions)  the 
distribution  is  very  close  to  a  normal  distribution,  namely  this  one: 


fix) 


1 


exp 


(x-  10)^ 


v^Vl000(.01)(.99)  V2(1000)(.01)(.99) 
which  is  pictured  in  figure  9.8.3  (recall  that  exp(a:)  =  e^). 


0  5  10         15         20  25 

Figure  9.8.3     Normal  density  function  for  the  defective  chips  example. 


Now  how  do  we  measure  how  unlikely  it  is  that  under  normal  circumstances  we  would 
see  15  defective  chips?  We  can't  compute  the  probability  of  exactly  15  defective  chips,  as 

/.15  /.15.5 

this  would  be  /     /(x)  (ix  =  0.  We  could  compute  /       /(x)  (ix  ~  0.036;  this  means  there 

Jl5  il4.5 

is  only  a  3.6%  chance  that  the  number  of  defective  chips  is  15.  (We  cannot  compute  these 
integrals  exactly;  computer  software  has  been  used  to  approximate  the  integral  values  in 

^•10. 5 

this  discussion.)  But  this  is  misleading:    /       f{x)dx  ~  0.126,  which  is  larger,  certainly, 

^9.5 

but  still  small,  even  for  the  "most  likely"  outcome.  The  most  useful  question,  in  most 
circumstances,  is  this:  how  likely  is  it  that  the  number  of  defective  chips  is  "far  from" 
the  mean?  For  example,  how  likely,  or  unlikely,  is  it  that  the  number  of  defective  chips 
is  different  by  5  or  more  from  the  expected  value  of  10?  This  is  the  probability  that  the 
number  of  defective  chips  is  less  than  5  or  larger  than  15,  namely 

f{x)dx+  /     f{x)dx^  0.11. 

-oo  Jl5 

So  there  is  an  11%  chance  that  this  happens — not  large,  but  not  tiny.  Hence  the  15 
defective  chips  does  not  appear  to  be  cause  for  alarm:  about  one  time  in  nine  we  would 
expect  to  see  the  number  of  defective  chips  5  or  more  away  from  the  expected  10.  How 
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about  20?  Here  we  compute 

0  poo 

f{x)dx+  /  ^  0.0015. 

So  there  is  only  a  0.15%  chance  that  the  number  of  defective  chips  is  more  than  10  away 
from  the  mean;  this  would  typically  be  interpreted  as  too  suspicious  to  ignore — it  shouldn't 
happen  if  the  process  is  running  normally. 

The  big  question,  of  course,  is  what  level  of  improbability  should  trigger  concern? 
It  depends  to  some  degree  on  the  application,  and  in  particular  on  the  consequences  of 
getting  it  wrong  in  one  direction  or  the  other.  If  we're  wrong,  do  we  lose  a  little  money? 
A  lot  of  money?  Do  people  die?  In  general,  the  standard  choices  are  5%  and  1%.  So  what 
we  should  do  is  find  the  number  of  defective  chips  that  has  only,  let  us  say,  a  1%  chance 
of  occurring  under  normal  circumstances,  and  use  that  as  the  relevant  number.  In  other 
words,  we  want  to  know  when 


/lO  —  r  /"CXD 
f{x)dx+  /      f{x)dx  <  0.01. 
-00  JlO+r 


A  bit  of  trial  and  error  shows  that  with  r  =  8  the  value  is  about  0.011,  and  with  r  =  9  it 
is  about  0.004,  so  if  the  number  of  defective  chips  is  19  or  more,  or  1  or  fewer,  we  should 
look  for  problems.  If  the  number  is  high,  we  worry  that  the  manufacturing  process  has 
a  problem,  or  conceivably  that  the  process  that  tests  for  defective  chips  is  not  working 
correctly  and  is  flagging  good  chips  as  defective.  If  the  number  is  too  low,  we  suspect  that 
the  testing  procedure  is  broken,  and  is  not  detecting  defective  chips. 

Exercises  9.8. 


/oo 
e^^^'^  dx  =  2/\/e- 


2.  Show  that  the  function  in  example  9.8.5  is  a  probabihty  density  function.  Compute  the  mean 
and  standard  deviation. 

3.  Compute  the  mean  and  standard  deviation  of  the  uniform  distribution  on  [a,b].  (See  exam- 
ple 9.8.3.)  ^ 

4.  What  is  the  expected  value  of  one  roll  of  a  fair  six-sided  die?  =^> 

5.  What  is  the  expected  sum  of  one  roh  of  three  fair  six-sided  dice?  =^ 

6.  Let  fi  and  a  be  real  numbers  with  a  >  0.  Show  that 

N{X)  =    e  2a2 


/27rf7 

is  a  probability  density  function.  You  will  not  be  able  to  compute  this  integral  directly;  use 
a  substitution  to  convert  the  integral  into  the  one  from  example  9.8.4.  The  function  A''  is 
the  probability  density  function  of  the  normal  distribution  with  mean  fi  and  standard 
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deviation  a.  Show  that  the  mean  of  the  normal  distribution  is  fi  and  the  standard  deviation 
is  a. 

7.  Let 

f  4  ixi  >  1 


0      Ixl  <  1 


Show  that  /  is  a  probabihty  density  function,  and  that  the  distribution  has  no  mean. 

8.  Let 

{X     —1  <  X  <  1 
1  l<x<2 
0  otherwise. 

/CXD 
f{x)  dx  =  1.  Is  /  a  probabihty  density  function?  Justify  your  answer. 
-  CXD 

9.  If  you  have  access  to  appropriate  software,  find  r  so  that 


fix)  dx+  fix)  dx  «  0.05. 

-oo  JlO+r 


'10+r 

Discuss  the  impact  of  using  this  new  value  of  r  to  decide  whether  to  investigate  the  chip 
manufacturing  process.  =J> 


9.9    Arc  Length 

Here  is  another  geometric  application  of  the  integral:  find  the  length  of  a  portion  of  a 
curve.  As  usual,  we  need  to  think  about  how  we  might  approximate  the  length,  and  turn 
the  approximation  into  an  integral. 

We  already  know  how  to  compute  one  simple  arc  length,  that  of  a  line  segment.  If  the 
endpoints  are  PoixQ,yo)  and  Piixi,yi)  then  the  length  of  the  segment  is  the  distance  be- 
tween the  points,  ■>/ (xi  —  xq)^  +  iyi  —  yo)"^,  from  the  Pythagorean  theorem,  as  illustrated 
in  figure  9.9.1. 


Figure  9.9.1     The  length  of  a  line  segment. 


Now  if  the  graph  of  /  is  "nice"  (say,  difFerentiable)  it  appears  that  we  can  approximate 
the  length  of  a  portion  of  the  curve  with  line  segments,  and  that  as  the  number  of  segments 
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Figure  9.9.2     Approximating  arc  length  with  hne  segments. 

increases,  and  their  lengths  decrease,  the  sum  of  the  lengths  of  the  line  segments  will 
approach  the  true  arc  length;  see  figure  9.9.2. 

Now  we  need  to  write  a  formula  for  the  sum  of  the  lengths  of  the  line  segments,  in  a 
form  that  we  know  becomes  an  integral  in  the  limit.  So  we  suppose  we  have  divided  the 
interval  [a,  b]  into  n  subintervals  as  usual,  each  with  length  Ax  =  {b  —  a)/n,  and  endpoints 
a  =  xo,  xi,  X2,  Xn  =  b.  The  length  of  a  typical  line  segment,  joining  {xi,f{xi)) 
to  {xi+ij{xi+i)),  is  v^(Ax)2  +  (/(xi+i)  -  fixi))^.  By  the  Mean  Value  Theorem  (6.5.2), 
there  is  a  number  ti  in  {xi,Xi+i)  such  that  f'{ti)Ax  =  f{xi+i)  —  f{xi),  so  the  length  of 
the  line  segment  can  be  written  as 

v/(Ax)2  +  (/'(t,))2Ax2  =  ^l  +  {f'{U))^Ax. 

The  arc  length  is  then 

n-l  „6 

lim  VvTTCmFAx=  /   Vl  +  (/'(x))2rfx. 

Note  that  the  sum  looks  a  bit  different  than  others  we  have  encountered,  because  the 
approximation  contains  a  ti  instead  of  an  Xi.  In  the  past  we  have  always  used  left  endpoints 
(namely,  Xi)  to  get  a  representative  value  of  /  on  [xi,Xj-(-i];  now  we  are  using  a  different 
point,  but  the  principle  is  the  same. 

To  summarize,  to  compute  the  length  of  a  curve  on  the  interval  [a,  b] ,  we  compute  the 
integral 

b 

^/l  +  {f'{x))^dx. 

Unfortunately,  integrals  of  this  form  are  typically  difficult  or  impossible  to  compute  exactly, 
because  usually  none  of  our  methods  for  finding  antiderivatives  will  work.  In  practice  this 
means  that  the  integral  will  usually  have  to  be  approximated. 
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EXAMPLE  9.9.1  Let  f{x)  =  v  —  x^,  the  upper  half  circle  of  radius  r.  The  length 
of  this  curve  is  half  the  circumference,  namely  nr.  Let's  compute  this  with  the  arc  length 


formula.  The  derivative  /'  is  —x/ y     —      so  the  integral  is 


r   '  '  J —r   '    '  J —r 


— ^dx=  [   \  — Tjdx^r  [  a/^t" — odx. 


Using  a  trigonometric  substitution,  we  find  the  antiderivative,  namely  arcsin(x/r).  Notice 


that  the  integral  is  improper  at  both  endpoints,  as  the  function  y  l/(r2  —  x^)  is  undefined 
when  X  =  ±r.  So  we  need  to  compute 


0     I  ^   rD 


lim     /    \l  — 7^dx+  lim    /    J      — dx. 

D^-r+  J j^)    \j        —  X'^  D^r-  Jq      V  ^  3^ 

This  is  not  difficult,  and  has  value  tt,  so  the  original  integral,  with  the  extra  r  in  front,  has 
value  irr  as  expected.  □ 

Exercises  9.9. 

1.  Find  the  arc  length  of  f{x)  =  x^^^  on  [0,2].  =^> 

2.  Find  the  arc  length  of  f{x)  =         —  Inx  on  [1,  2]. 

3.  Find  the  arc  length  of  f{x)  =  (l/3)(a;^  +  2)^^"^  on  the  interval  [0,a]. 

4.  Find  the  arc  length  of  f{x)  =  ln(sina;)  on  the  interval  [7r/4,7r/3]. 

/■a 

5.  Let  a  >  0.  Show  that  the  length  of  y  =  cosh  a;  on  [0,  o]  is  equal  to  /  coshxdx. 

Jo 

6.  Find  the  arc  length  of  f{x)  =  cosh  a;  on  [0,ln2]. 

7.  Set  up  the  integral  to  find  the  arc  length  of  sin  a;  on  the  interval  [0,7r];  do  not  evaluate  the 
integral.  If  you  have  access  to  appropriate  software,  approximate  the  value  of  the  integral. 

=^ 

8.  Set  up  the  integral  to  find  the  arc  length  of  y  =  xe^^  on  the  interval  [2,  3];  do  not  evaluate  the 
integral.  If  you  have  access  to  appropriate  software,  approximate  the  value  of  the  integral. 

=^ 

9.  Find  the  arc  length  of  y  =       on  the  interval  [0, 1].   (This  can  be  done  exactly;  it  is  a  bit 
tricky  and  a  bit  long.) 


9.10    Surface  Area 

Another  geometric  question  that  arises  naturally  is:  "What  is  the  surface  area  of  a  vol- 
ume?" For  example,  what  is  the  surface  area  of  a  sphere?  More  advanced  techniques 
are  required  to  approach  this  question  in  general,  but  we  can  compute  the  areas  of  some 
volumes  generated  by  revolution. 
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As  usual,  the  question  is:  how  might  we  approximate  the  surface  area?  For  a  surface 
obtained  by  rotating  a  curve  around  an  axis,  we  can  take  a  polygonal  approximation  to 
the  curve,  as  in  the  last  section,  and  rotate  it  around  the  same  axis.  This  gives  a  surface 
composed  of  many  "truncated  cones;"  a  truncated  cone  is  called  a  frustum  of  a  cone. 
Figure  9.10.1  illustrates  this  approximation. 


Figure  9.10.1     Approximating  a  surface  (left)  by  portions  of  cones  (right). 

So  we  need  to  be  able  to  compute  the  area  of  a  frustum  of  a  cone.  Since  the  frustum 
can  be  formed  by  removing  a  small  cone  from  the  top  of  a  larger  one,  we  can  compute 
the  desired  area  if  we  know  the  surface  area  of  a  cone.  Suppose  a  right  circular  cone  has 
base  radius  r  and  slant  height  h.  If  we  cut  the  cone  from  the  vertex  to  the  base  circle 
and  flatten  it  out,  we  obtain  a  sector  of  a  circle  with  radius  h  and  arc  length  27rr,  as  in 
figure  9.10.2.  The  angle  at  the  center,  in  radians,  is  then  27rr//i,  and  the  area  of  the  cone 
is  equal  to  the  area  of  the  sector  of  the  circle.  Let  A  be  the  area  of  the  sector;  since  the 
area  of  the  entire  circle  is  tt/i^,  we  have 

A   _  2'Kr/h 
nh'^  27r 
A  =  nrh. 

Now  suppose  we  have  a  frustum  of  a  cone  with  slant  height  h  and  radii  tq  and  ri,  as 
in  figure  9.10.3.  The  area  of  the  entire  cone  is  Tcri{ho  +  h),  and  the  area  of  the  small  cone 
is  irroho;  thus,  the  area  of  the  frustum  is  7rri(/io  +  h)  —  irroho  =  7r((ri  —  ro)ho  +  rih).  By 
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Figure  9.10.2     The  area  of  a  cone. 

similar  triangles, 

^0  _  ^0  +  ^ 

ro  ri 

With  a  bit  of  algebra  this  becomes  (ri  —  ro)ho  =  roh;  substitution  into  the  area  gives 

Tq  +  Ti 

7r((ri  —  ro)/io  +  ^i^)  =  ^{roh  +  rih)  =  7rh(ro  +  ri)  =  27i —  h  =  2'Krh. 

The  final  form  is  particularly  easy  to  remember,  with  r  equal  to  the  average  of  ro  and  ri, 
as  it  is  also  the  formula  for  the  area  of  a  cylinder.  (Think  of  a  cylinder  of  radius  r  and 
height  h  as  the  frustum  of  a  cone  of  infinite  height.) 


Figure  9.10.3     The  area  of  a  frustum. 


Now  we  are  ready  to  approximate  the  area  of  a  surface  of  revolution.  On  one  subin- 
terval,  the  situation  is  as  shown  in  figure  9.10.4.  When  the  line  joining  two  points  on  the 
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curve  is  rotated  around  the  a;-axis,  it  forms  a  frustum  of  a  cone.  The  area  is 


Here  yl  +  (/'(ti))^  Ax  is  the  length  of  the  hne  segment,  as  we  found  in  the  previous 
section.  Assuming  /  is  a  continuous  function,  there  must  be  some  x*  in  [a;j,a;j+i]  such 
that  {f{xi)  +  f{xi+i))/2  =  f{x*),  so  the  approximation  for  the  surface  area  is 


n-l 


5^27r/(<)VTTCmFAx. 


1=0 


This  is  not  quite  the  sort  of  sum  we  have  seen  before,  as  it  contains  two  different  values 
in  the  interval  namely  x*  and  ti.  Nevertheless,  using  more  advanced  techniques 

than  we  have  available  here,  it  turns  out  that 

hm  J2'^7Tf{x*)Vl  +  {f'{ti)rAx=  /  27Tf{xWl  +  if'{x))^dx 

is  the  surface  area  we  seek.  (Roughly  speaking,  this  is  because  while  x\  and  ti  are  distinct 
values  in  [a;i,a;i+i],  they  get  closer  and  closer  to  each  other  as  the  length  of  the  interval 
shrinks.) 


I 


Figure  9.10.4     One  subinterval. 


EXAMPLE  9.10.1  We  compute  the  surface  area  of  a  sphere  of  radius  r.  The  sphere  can 
be  obtained  by  rotating  the  graph  of  j{x)  —  \/ r"^  —  x^  about  the  a;-axis.  The  derivative 
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/'  is  —  x/  V     —  x^,  so  the  surface  area  is  given  by 


□ 

If  the  curve  is  rotated  around  the  y  axis,  the  formula  is  nearly  identical,  because  the 
length  of  the  line  segment  we  use  to  approximate  a  portion  of  the  curve  doesn't  change. 
Instead  of  the  radius  f{x*),  we  use  the  new  radius  Xi  =  {xi  +  Xi+i)/2,  and  the  surface  area 
integral  becomes 


27rxVl  +  {f'{x))'^dx. 


EXAMPLE  9.10.2     Compute  the  area  of  the  surface  formed  when  f{x)  =  x^  between 
0  and  2  is  rotated  around  the  y-axis. 
We  compute  /'(x)  =  2x,  and  then 

271  [  xVl  +  4x2rfx  =  -(17^/2  _  lY 
Jo  6 

by  a  simple  substitution.  □ 
Exercises  9.10. 

1.  Compute  the  area  of  the  surface  formed  when  f{x)  =  2\/\  —  x  between  —1  and  0  is  rotated 
around  the  x-axis.  =^ 

2.  Compute  the  surface  area  of  example  9.10.2  by  rotating  f{x)  =  \fx  around  the  x-axis. 

3.  Compute  the  area  of  the  surface  formed  when  fix)  =  x^  between  1  and  3  is  rotated  around 
the  X-axis. 

4.  Compute  the  area  of  the  surface  formed  when  /(x)  =  2  +  cosh(x)  between  0  and  1  is  rotated 
around  the  x-axis.  =^ 

5.  Consider  the  surface  obtained  by  rotating  the  graph  of  /(x)  =  1/x,  x  >  1,  around  the  x-axis. 
This  surface  is  called  Gabriel's  horn  or  Toricelli's  trumpet.  In  exercise  13  in  section  9.7 
we  saw  that  Gabriel's  horn  has  finite  volume.  Show  that  Gabriel's  horn  has  infinite  surface 
area. 

6.  Consider  the  circle  (x  —  +  =  1.  Sketch  the  surface  obtained  by  rotating  this  circle 
about  the  y-axis.  (The  surface  is  called  a  torus.)  What  is  the  surface  area?  =^ 
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7.  Consider  the  ellipse  with  equation  / A  +  -y^  =  1.  If  the  ellipse  is  rotated  around  the  a;-axis 
it  forms  an  ellipsoid.  Compute  the  surface  area. 

8.  Generalize  the  preceding  result:  rotate  the  ellipse  given  by  x' jc?  +  y' j^?  =  1  about  the 
X-axis  and  find  the  surface  area  of  the  resulting  ellipsoid.  You  should  consider  two  cases, 
when  a  >  b  and  when  a  <  b.  Compare  to  the  area  of  a  sphere.  =^ 


10 

Polar  Coordinates, 
Parametric  Equations 


10.1    Polar  Coordinates 

Coordinate  systems  are  tools  that  let  us  use  algebraic  methods  to  understand  geometry. 
While  the  rectangular  (also  called  Cartesian)  coordinates  that  we  have  been  using  are 
the  most  common,  some  problems  are  easier  to  analyze  in  alternate  coordinate  systems. 

A  coordinate  system  is  a  scheme  that  allows  us  to  identify  any  point  in  the  plane  or 
in  three-dimensional  space  by  a  set  of  numbers.  In  rectangular  coordinates  these  numbers 
are  interpreted,  roughly  speaking,  as  the  lengths  of  the  sides  of  a  rectangle.  In  polar 
coordinates  a  point  in  the  plane  is  identified  by  a  pair  of  numbers  (r,  9).  The  number  9 
measures  the  angle  between  the  positive  x-axis  and  a  ray  that  goes  through  the  point,  as 
shown  in  figure  10.1.1;  the  number  r  measures  the  distance  from  the  origin  to  the  point. 
Figure  10.1.1  shows  the  point  with  rectangular  coordinates  (1,  a/S)  and  polar  coordinates 
(2,  vr/3),  2  units  from  the  origin  and  7r/3  radians  from  the  positive  x-axis. 


1 


Figure  10.1.1     Polar  coordinates  of  the  point  (1,  V^). 
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Just  as  we  describe  curves  in  the  plane  using  equations  involving  x  and  y,  so  can  we 
describe  curves  using  equations  involving  r  and  9.  Most  common  are  equations  of  the  form 

r  =  f{e). 

EXAMPLE  10.1.1     Graph  the  curve  given  by  r  =  2.   All  points  with  r  =  2  are  at 

distance  2  from  the  origin,  so  r  =  2  describes  the  circle  of  radius  2  with  center  at  the 

origin.  □ 

EXAMPLE  10.1.2  Graph  the  curve  given  by  r  =  1  +  cos^.  We  first  consider  y  = 
1  +  cosx,  as  in  figure  10.1.2.  As  9  goes  through  the  values  in  [0,  27r],  the  value  of  r  tracks 
the  value  of  y,  forming  the  "cardioid"  shape  of  figure  10.1.2.  For  example,  when  9  =  n/2, 
r  =  1  +  cos(7r/2)  =  1,  so  we  graph  the  point  at  distance  1  from  the  origin  along  the 
positive  y-axis,  which  is  at  an  angle  of  7r/2  from  the  positive  x-axis.  When  9  =  77r/4, 
r  =  1  +  cos(77r/4)  =  1  +  V2/2  ^  1.71,  and  the  corresponding  point  appears  in  the  fourth 
quadrant.  This  illustrates  one  of  the  potential  benefits  of  using  polar  coordinates:  the 
equation  for  this  curve  in  rectangular  coordinates  would  be  quite  complicated.  □ 


Figure  10.1.2     A  cardioid:  y  =  1  +  cosx  on  the  left,  r  =  1  +  cos9  on  the  right. 

Each  point  in  the  plane  is  associated  with  exactly  one  pair  of  numbers  in  the  rect- 
angular coordinate  system;  each  point  is  associated  with  an  infinite  number  of  pairs  in 
polar  coordinates.  In  the  cardioid  example,  we  considered  only  the  range  0  <  ^  <  27r, 
and  already  there  was  a  duplicate:  (2,  0)  and  (2,  2%)  are  the  same  point.  Indeed,  every 
value  of  9  outside  the  interval  [0,  2%)  duplicates  a  point  on  the  curve  r  =  1  +  cos  9  when 
0  <  ^  <  2tt.  We  can  even  make  sense  of  polar  coordinates  like  (—2,  n/4):  go  to  the  direc- 
tion 7r/4  and  then  move  a  distance  2  in  the  opposite  direction;  see  figure  10.1.3.  As  usual, 
a  negative  angle  9  means  an  angle  measured  clockwise  from  the  positive  x-axis.  The  point 
in  figure  10.1.3  also  has  coordinates  (2,57r/4)  and  (2,— 37r/4). 

The  relationship  between  rectangular  and  polar  coordinates  is  quite  easy  to  under- 
stand. The  point  with  polar  coordinates  (r,  9)  has  rectangular  coordinates  x  =  r  cos  9 
and  y  =  rsin^;  this  follows  immediately  from  the  definition  of  the  sine  and  cosine  func- 
tions.  Using  figure  10.1.3  as  an  example,  the  point  shown  has  rectangular  coordinates 
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-1  / 
/ 

/-I 
-2 


Figure  10.1.3     The  point  (-2,  7r/4)  =  (2,  57r/4)  =  (2,  -37r/4)  in  polar  coordinates. 

X  =  (-2)  cos(7r/4)  =  -V2  ^  1.4142  and  y  =  (-2)  sin(7r/4)  =  -V2.  This  makes  it  very- 
easy  to  convert  equations  from  rectangular  to  polar  coordinates. 


EXAMPLE  10.1.3     Find  the  equation  of  the  line  y  =  3x  +  2  in  polar  coordinates.  We 

2 

merely  substitute:  r  sin6'  =  3r  cos6'  +  2,  or  r  =  -.  □ 

sm  0  —  6  cos  0 

EXAMPLE  10.1.4  Find  the  equation  of  the  circle  (x  -  1/2)^  +  j/^  =  1/4  in  polar 
coordinates.  Again  substituting:  (r  cos^  —  1/2)^  +  sin^  9  =  1/4.  A  bit  of  algebra  turns 
this  into  r  =  cos(t).  You  should  try  plotting  a  few  (r,  ^)  values  to  convince  yourself  that 
this  makes  sense.  □ 


EXAMPLE  10.1.5  Graph  the  polar  equation  r  =  9.  Here  the  distance  from  the  origin 
exactly  matches  the  angle,  so  a  bit  of  thought  makes  it  clear  that  when  ^  >  0  we  get  the 
spiral  of  Archimedes  in  figure  10.1.4.  When  ^  <  0,  r  is  also  negative,  and  so  the  full  graph 
is  the  right  hand  picture  in  the  figure.  □ 


Converting  polar  equations  to  rectangular  equations  can  be  somewhat  trickier,  and 
graphing  polar  equations  directly  is  also  not  always  easy. 
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EXAMPLE  10.1.6  Graph  r  =  2sin^.  Because  the  sine  is  periodic,  we  know  that  we 
will  get  the  entire  curve  for  values  of  9  in  [0,27r).  As  9  runs  from  0  to  n/2,  r  increases 
from  0  to  2.  Then  as  9  continues  to  tt,  r  decreases  again  to  0.  When  9  runs  from  tt  to 
27r,  r  is  negative,  and  it  is  not  hard  to  see  that  the  first  part  of  the  curve  is  simply  traced 
out  again,  so  in  fact  we  get  the  whole  curve  for  values  of  9  in  [0,  tt).  Thus,  the  curve  looks 
something  Uke  figure  10.1.5.  Now,  this  suggests  that  the  curve  could  possibly  be  a  circle, 
and  if  it  is,  it  would  have  to  be  the  circle  x'^  +  {y  —  1)^  =  1.  Having  made  this  guess,  we 
can  easily  check  it.  First  we  substitute  for  x  and  y  to  get  (rcos6')^  +  (rsin6'  —  1)^  =  1; 
expanding  and  simplifying  does  indeed  turn  this  into  r  =  2  sin  9.  a 


Exercises  10.1. 

1.  Plot  these  polar  coordinate  points  on  one  graph:  (2,7r/3),  (— 3,7r/2),  (— 2,  — 7r/4),  (1/2, tt), 
(l,47r/3),  (0,37r/2). 

Find  an  equation  in  polar  coordinates  that  has  the  same  graph  as  the  given  equation  in  rectangular 
coordinates. 

2.  y  =  3a;  ^  3.  y  =  -4  ^ 
4.    xy^  =  1  5.  a;^  +      =  5 
6.    y  =  X   =^  7.  y  =  sina;=^ 
8.    y  =  5a;  +  2^  9.  a;  =  2^ 

10.   y  =      +  1  ^  11-  y  =  3x^  -  2a;  ^ 

12.  y  =      +  y^  =^ 

Sketch  the  curve. 

13.  r  =  cos6'  14.  r  =  sin(6l  +  7r/4) 
15.   r  =  -sec6'  16.  r  =  6'/2,  6*  >  0 
17.   r  =  l  +  6'V7i-^  18-  r  =  cot  6*  CSC  6* 


1 


19.   r  =   .   ^  ^   20.       = -2  sec  6*  CSC  6* 

sm  0  +  cos  0 
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Find  an  equation  in  rectangular  coordinates  that  has  the  same  graph  as  the  given  equation  in 
polar  coordinates. 

21.    r  =  sin(36l)  22.    r  =  sin^  0 

23.    r  =  sec  0  CSC  6*  =^>  24.    r  =  tan6'=^> 

10.2    Slopes  in  polar  coordinates 

When  we  describe  a  curve  using  polar  coordinates,  it  is  still  a  curve  in  the  x-y  plane.  We 
would  like  to  be  able  to  compute  slopes  and  areas  for  these  curves  using  polar  coordinates. 

We  have  seen  that  x  =  r  cos  9  and  y  =  r  sin  9  describe  the  relationship  between  polar 
and  rectangular  coordinates.  If  in  turn  we  are  interested  in  a  curve  given  by  r  =  f{9), 
then  we  can  write  x  =  f{9)  cos  9  and  y  =  f{9)  sin^,  describing  x  and  y  in  terms  of  9  alone. 
The  first  of  these  equations  describes  9  implicitly  in  terms  of  x,  so  using  the  chain  rule  we 
may  compute 

dy  dy  d9 
dx     d9  dx 

Since  d9/dx  =  l/{dx/d9),  we  can  instead  compute 

dy  _dy/d9  _  f{9)cos9  +  f'{9)sm9 
dx  ~  dx/d9  ~  -f{9)  sm9  +  f'{9)  cos9' 

EXAMPLE  10.2.1  Find  the  points  at  which  the  curve  given  by  r  =  1  +  cos^  has  a 
vertical  or  horizontal  tangent  line.  Since  this  function  has  period  27r,  we  may  restrict  our 
attention  to  the  interval  [0,  27r)  or  (— 7r,7r],  as  convenience  dictates.  First,  we  compute  the 
slope: 

dy  (1  +  cos  6*)  cos  6*  —  sin  6*  sin  6*  cos  6*  +  cos^  6*  —  sin^  6* 
dx      —  (1  +  cos  6*)  sin  6*  —  sin  6*  cos  6*      —  sin  6*  —  2  sin  6*  cos  6* 

This  fraction  is  zero  when  the  numerator  is  zero  (and  the  denominator  is  not  zero).  The 
numerator  is  2  cos^  9  +  cos  ^  —  1  so  by  the  quadratic  formula 

-l±v/l  +  4-2  1 

cost'  =   =  — 1    or  -. 

4  2 

This  means  ^  is  tt  or  ±7r/3.  However,  when  9  =  it,  the  denominator  is  also  0,  so  we  cannot 
conclude  that  the  tangent  line  is  horizontal. 
Setting  the  denominator  to  zero  we  get 

-9  -  2sin^cos^  =  0 
sin^(l  +  2cos^)  =  0, 

so  either  sin^  =  0  or  cos^  =  —1/2.  The  first  is  true  when  ^  is  0  or  tt,  the  second  when  9 
is  2n/3  or  4n/3.  However,  as  above,  when  9  =  tt,  the  numerator  is  also  0,  so  we  cannot 
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conclude  that  the  tangent  hne  is  vertical.  Figure  10.2.1  shows  points  corresponding  to  9 
equal  to  0,  ±7r/3,  2n/3  and  4n/3  on  the  graph  of  the  function.  Note  that  when  9  =  n  the 
curve  hits  the  origin  and  does  not  have  a  tangent  line.  □ 


Figure  10.2.1     Points  of  vertical  and  horizontal  tangency  for  r  =  1  +  cos^. 

We  know  that  the  second  derivative  f"{x)  is  useful  in  describing  functions,  namely, 
in  describing  concavity.  We  can  compute  f"{x)  in  terms  of  polar  coordinates  as  well.  We 
already  know  how  to  write  dy/dx  =  y'  in  terms  of  9,  then 

d  dy      dy'      dy'  d9  dy'/d9 


dx  dx      dx      d9  dx  dx/d9 

EXAMPLE  10.2.2    We  find  the  second  derivative  for  the  cardioid  r  =  1  +  cos  9: 
d  cos9  +  cos^9-sm'^9       1  3(1  + cos  ^)  1 


d9  —  sin 6*  —  2  sin 6*  cos  6*    dx/d9  (sin 6*  +  2  sin 6*  cos  6*) ^    —(sin 6*  +  2  sin 6*  cos  6*) 

-3(1  + cos  ^) 
(sin  9  +  2  sin  9  cos  9)^ 

The  ellipsis  here  represents  rather  a  substantial  amount  of  algebra.  We  know  from  above 
that  the  cardioid  has  horizontal  tangents  at  ±7r/3;  substituting  these  values  into  the  second 
derivative  we  get  y"{n/3)  =  —V3/2  and  ?/"(— 7r/3)  =  V^/2,  indicating  concave  down  and 
concave  up  respectively.  This  agrees  with  the  graph  of  the  function.  □ 

Exercises  10.2. 

Compute  y'  =  dy/dx  and  y"  =  d^y/dx^ . 

1.    r  =  e^  2.  r  =  l  +  sme  ^ 

3.    r  =  cos  9  4.  r  =  sinO  =^ 

5.    r  =  sec9  =^  6.  r  =  sin(20) 
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Sketch  the  curves  over  the  interval  [0,  27r]  unless  otherwise  stated. 
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10.3    Areas  in  polar  coordinates 

We  can  use  the  equation  of  a  curve  in  polar  coordinates  to  compute  some  areas  bounded 
by  such  curves.  The  basic  approach  is  the  same  as  Vk^ith  any  apphcation  of  integration:  find 
an  approximation  that  approaches  the  true  value.  For  areas  in  rectangular  coordinates,  we 
approximated  the  region  using  rectangles;  in  polar  coordinates,  we  use  sectors  of  circles, 
as  depicted  in  figure  10.3.1.  Recall  that  the  area  of  a  sector  of  a  circle  is  Q;r^/2,  where 
a  is  the  angle  subtended  by  the  sector.  If  the  curve  is  given  by  r  =  f{ff),  and  the  angle 
subtended  by  a  small  sector  is  A6',  the  area  is  (A6')(/(6'))^/2.  Thus  we  approximate  the 
total  area  as 

n—  1  ^ 

In  the  limit  this  becomes 

EXAMPLE  10.3.1     We  find  the  area  inside  the  cardioid  r  =  1  +  cos6'. 

/  -(l+cosOf  de  =  -  l+2cos6'+cos2  6'rf6'=  -  (6'  +  2sin6'+-  +  — — ) 
Jo     2  2  io  2  '  2  4' 

□ 


EXAMPLE  10.3.2  We  find  the  area  between  the  circles  r  =  2  and  r  =  4sin^,  as  shown 
in  figure  10.3.2.  The  two  curves  intersect  where  2  =  4sin^,  or  sin^  =  1/2,  so  ^  =  7r/6  or 
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Figure  10.3.1     Approximating  area  by  sectors  of  circles. 
57r/6.  The  area  we  want  is  then 


Figure  10.3.2     An  area  between  curves. 

This  example  makes  the  process  appear  more  straightforward  than  it  is.  Because 
points  have  many  different  representations  in  polar  coordinates,  it  is  not  always  so  easy  to 
identify  points  of  intersection. 

EXAMPLE  10.3.3  We  find  the  shaded  area  in  the  first  graph  of  figure  10.3.3  as  the 
difference  of  the  other  two  shaded  areas.  The  cardioid  is  r  =  1  +  sin^  and  the  circle  is 
r  =  3sin^.  We  attempt  to  find  points  of  intersection: 

1  +  sin^  =  3sin^ 
1  =  2sme 
1/2  =  sine. 

This  has  solutions  9  =  n/6  and  Stt/G;  tt/G  corresponds  to  the  intersection  in  the  first  quad- 
rant that  we  need.  Note  that  no  solution  of  this  equation  corresponds  to  the  intersection 
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point  at  the  origin,  but  fortunately  that  one  is  obvious.  The  cardioid  goes  through  the 
origin  when  9  =  — 7r/2;  the  circle  goes  through  the  origin  at  multiples  of  tt,  starting  with 
0. 

Now  the  larger  region  has  area 


and  the  smaller  has  area 

-  /    {3sme)Ue  =  —  --V3 

2  Jq  8  16 

so  the  area  we  seek  is  tt/S.  □ 


Figure  10.3.3     An  area  between  curves. 


Exercises  10.3. 


Find  the  area  enclosed  by  the  curve. 
1.    r  =  A/sin6'  =^  2.    r  =  2  +  cos0=> 

3.    r  =  sec6',7r/6  <  6*  <  7r/3  ^  4.    r  =  cos  6*,  0  <  6*  <  7r/3  ^ 

5.    r  =  2acos0,  a  >  0  ^  6.    r  =  4  +  3  sin  0  ^ 

7.  Find  the  area  inside  the  loop  formed  by  r  =  tan(0/2).  =^> 

8.  Find  the  area  inside  one  loop  of  r  =  cos(30).  =^ 

9.  Find  the  area  inside  one  loop  of  r  =  sin'^  0.  =^ 

10.  Find  the  area  inside  the  small  loop  of  r  =  (1/2)  +  cosO.  =f- 

11.  Find  the  area  inside  r  =  (1/2)  +  cosO,  including  the  area  inside  the  small  loop. 

12.  Find  the  area  inside  one  loop  of      =  cos(20). 

CSC  9 


13.  Find  the  area  enclosed  by  r  =  tan0  and  r 


V2 


250     Chapter  10  Polar  Coordinates,  Parametric  Equations 


14.  Find  the  area  inside  r  =  2  cos  9  and  outside  r  =  1.  =^ 

15.  Find  the  area  inside  r  =  2sin0  and  above  the  line  r  =  (3/2)  esc  9.  =^ 

16.  Find  the  area  inside  r  =  9,  0  <  9  <  2tv. 

17.  Find  the  area  inside  r  =  V9,  0  <  9  <  27r.  =^ 

18.  Find  the  area  inside  both  r  =  \/3cos0  and  r  =  sin  9.  =^ 

19.  Find  the  area  inside  both  r  =  1  —  cos  9  and  r  =  cos  9. 

20.  The  center  of  a  circle  of  radius  1  is  on  the  circumference  of  a  circle  of  radius  2.  Find  the  area 
of  the  region  inside  both  circles.  =^ 

21.  Find  the  shaded  area  in  figure  10.3.4.  The  curve  is  r  =  6,  0  <  9  <  Stt.  =^ 


Figure  10.3.4     An  area  bounded  by  the  spiral  of  Archimedes. 

10.4    Parametric  Equations 

When  we  computed  the  derivative  dy/dx  using  polar  coordinates,  we  used  the  expressions 
X  =  f  (9)  cos  9  and  y  =  f(9)siii9.  These  two  equations  completely  specify  the  curve, 
though  the  form  r  =  f{9)  is  simpler.  The  expanded  form  has  the  virtue  that  it  can  easily 
be  generalized  to  describe  a  wider  range  of  curves  than  can  be  specified  in  rectangular  or 
polar  coordinates. 

Suppose  f{t)  and  g{t)  are  functions.  Then  the  equations  x  =  f{t)  and  y  =  g{t) 
describe  a  curve  in  the  plane.  In  the  case  of  the  polar  coordinates  equations,  the  variable 
t  is  replaced  by  9  which  has  a  natural  geometric  interpretation.  But  t  in  general  is  simply 
an  arbitrary  variable,  often  called  in  this  case  a  parameter,  and  this  method  of  specifying 
a  curve  is  known  as  parametric  equations.  One  important  interpretation  of  t  is  time. 
In  this  interpretation,  the  equations  x  =  f{t)  and  y  =  g{t)  give  the  position  of  an  object 
at  time  t. 
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EXAMPLE  10.4.1  Describe  the  path  of  an  object  that  moves  so  that  its  position  at 
time  t  is  given  by  x  =  cost,  y  =  cos^t.  We  see  immediately  that  y  =  ,  so  the  path  hes 
on  this  parabola.  The  path  is  not  the  entire  parabola,  however,  since  x  =  cost  is  always 
between  —1  and  1.  It  is  now  easy  to  see  that  the  object  oscillates  back  and  forth  on  the 
parabola  between  the  endpoints  (1, 1)  and  (—1, 1),  and  is  at  point  (1, 1)  at  time  t  =  0.  □ 

It  is  sometimes  quite  easy  to  describe  a  complicated  path  in  parametric  equations 
when  rectangular  and  polar  coordinate  expressions  are  difficult  or  impossible  to  devise. 

EXAMPLE  10.4.2  A  wheel  of  radius  1  rolls  along  a  straight  line,  say  the  x-axis.  A 
point  on  the  rim  of  the  wheel  will  trace  out  a  curve,  called  a  cycloid.  Assume  the  point 
starts  at  the  origin;  find  parametric  equations  for  the  curve. 

Figure  10.4.1  illustrates  the  generation  of  the  curve  (click  on  the  AP  link  to  see  an 
animation).  The  wheel  is  shown  at  its  starting  point,  and  again  after  it  has  rolled  through 
about  490  degrees.  We  take  as  our  parameter  t  the  angle  through  which  the  wheel  has 
turned,  measured  as  shown  clockwise  from  the  line  connecting  the  center  of  the  wheel 
to  the  ground.  Because  the  radius  is  1,  the  center  of  the  wheel  has  coordinates  (t,  1). 
We  seek  to  write  the  coordinates  of  the  point  on  the  rim  as  [t  +  Ax,  1  +  Ay),  where 
Ax  and  Ay  are  as  shown  in  figure  10.4.2.  These  values  are  nearly  the  sine  and  cosine 
of  the  angle  t,  from  the  unit  circle  definition  of  sine  and  cosine.  However,  some  care  is 
required  because  we  are  measuring  t  from  a  nonstandard  starting  line  and  in  a  clockwise 
direction,  as  opposed  to  the  usual  counterclockwise  direction.  A  bit  of  thought  reveals 
that  Ax  =  —  sint  and  Ay  =  —  cost.  Thus  the  parametric  equations  for  the  cycloid  are 
X  =  t  —  sint,  y  =  1  —  cost.  □ 


Figure  10.4.1     A  cycloid. 


Figure  10.4.2     The  wheel. 
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Exercises  10.4- 

1.  What  curve  is  described  hy  x  =  t'^ ,  y  =  t"^?  If  t  is  interpreted  as  time,  describe  how  the 
object  moves  on  the  curve. 

2.  What  curve  is  described  by  a;  =  3cosi,  y  =  3sini?  If  t  is  interpreted  as  time,  describe  how 
the  object  moves  on  the  curve. 

3.  What  curve  is  described  by  a;  =  3 cost,  y  =  2sint?  If  t  is  interpreted  as  time,  describe  how 
the  object  moves  on  the  curve. 

4.  What  curve  is  described  by  a;  =  3sint,  y  =  3 cost?  If  t  is  interpreted  as  time,  describe  how 
the  object  moves  on  the  curve. 

5.  Sketch  the  curve  described  by  x  =  t"^  —  t,  y  =  t^.  If  t  is  interpreted  as  time,  describe  how  the 
object  moves  on  the  curve. 

6.  A  wheel  of  radius  1  rolls  along  a  straight  line,  say  the  x-axis.  A  point  P  is  located  halfway 
between  the  center  of  the  wheel  and  the  rim;  assume  P  starts  at  the  point  (0, 1/2).  As  the 
wheel  rolls,  P  traces  a  curve;  find  parametric  equations  for  the  curve. 

7.  A  wheel  of  radius  1  rolls  around  the  outside  of  a  circle  of  radius  3.  A  point  P  on  the  rim  of 
the  wheel  traces  out  a  curve  called  a  hypercycloid,  as  indicated  in  figure  10.4.3.  Assuming 
P  starts  at  the  point  (3,0),  find  parametric  equations  for  the  curve. 


Figure  10.4.3     A  hypercycloid  and  a  hypocycloid. 

8.  A  wheel  of  radius  1  rolls  around  the  inside  of  a  circle  of  radius  3.  A  point  P  on  the  rim  of 
the  wheel  traces  out  a  curve  called  a  hypocycloid,  as  indicated  in  figure  10.4.3.  Assuming 
P  starts  at  the  point  (3,0),  find  parametric  equations  for  the  curve.  =^ 

9.  An  involute  of  a  circle  is  formed  as  follows:  Imagine  that  a  long  (that  is,  infinite)  string  is 
wound  tightly  around  a  circle,  and  that  you  grasp  the  end  of  the  string  and  begin  to  unwind 
it,  keeping  the  string  taut.  The  end  of  the  string  traces  out  the  involute.  Find  parametric 
equations  for  this  curve,  using  a  circle  of  radius  1,  and  assuming  that  the  string  unwinds 
counter-clockwise  and  the  end  of  the  string  is  initially  at  (1,0).  Figure  10.4.4  shows  part  of 
the  curve;  the  dotted  lines  represent  the  string  at  a  few  different  times.  =^ 
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Figure  10.4.4     An  involute  of  a  circle. 

10.5    Calculus  with  Parametric  Equations 

We  have  already  seen  how  to  compute  slopes  of  curves  given  by  parametric  equations — it 
is  how  we  computed  slopes  in  polar  coordinates. 

EXAMPLE  10.5.1  Find  the  slope  of  the  cycloid  x  —  t  — sint,  y  =  1  — cost.  We  compute 
x'  —  1  —  cost,  y'  —  sint,  so 

dy         sin  t 
dx      1  —  cos  t 

Note  that  when  t  is  an  odd  multiple  of  tt,  like  n  or  Stt,  this  is  (0/2)  =  0,  so  there  is 
a  horizontal  tangent  line,  in  agreement  with  figure  10.4.1.  At  even  multiples  of  tt,  the 
fraction  is  0/0,  which  is  undefined.  The  figure  shows  that  there  is  no  tangent  line  at  such 
points.  □ 

Areas  can  be  a  bit  trickier  with  parametric  equations,  depending  on  the  curve  and  the 
area  desired.  We  can  potentially  compute  areas  between  the  curve  and  the  x-axis  quite 
easily. 

EXAMPLE  10.5.2  Find  the  area  under  one  arch  of  the  cycloid  a:  =  t— sint,  y  =  1  — cost. 
We  would  like  to  compute 

J    y  dx, 

but  we  do  not  know  y  in  terms  of  x.  However,  the  parametric  equations  allow  us  to  make 
a  substitution:  use  y  =  1  —  cost  to  replace  y,  and  compute  dx  =  {1  —  cost)  dt.  Then  the 
integral  becomes 

/.27r 

/     (1  —  cost)(l  —  cost)  dt  =  Stt. 
Jo 
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Note  that  we  need  to  convert  the  original  x  hmits  to  t  hmits  using  x  =  t  —  sint.  When 
a;  =  0,  t  =  sint,  which  happens  only  when  t  =  0.  Likewise,  when  x  =  2n,  t  —  2n  =  sint 
and  t  =  2tv.  Alternately,  because  we  understand  how  the  cycloid  is  produced,  we  can  see 
directly  that  one  arch  is  generated  by  0  <  t  <  2n.  In  general,  of  course,  the  t  limits  will 
be  different  than  the  x  limits.  □ 

This  technique  will  allow  us  to  compute  some  quite  interesting  areas,  as  illustrated  by 
the  exercises. 

As  a  final  example,  we  see  how  to  compute  the  length  of  a  curve  given  by  parametric 
equations.  Section  9.9  investigates  arc  length  for  functions  given  as  y  in  terms  of  x,  and 
develops  the  formula  for  length: 


a 


Using  some  properties  of  derivatives,  including  the  chain  rule,  we  can  convert  this  to  use 
parametric  equations  x  =  f{t),  y  =  g{t): 


'^^  /  J  a   \  \dt  J       \dt  J    \dx  J  dx 


'("^y dt 


u   \  \dt  J  \dt 


V 


^/IrW+WWdt. 

Here  u  and  v  are  the  t  limits  corresponding  to  the  x  limits  a  and  b. 

EXAMPLE  10.5.3  Find  the  length  of  one  arch  of  the  cycloid.  From  x  =  t  —  sint, 
y  =  1  —  cost,  we  get  the  derivatives  /'  =  1  —  cost  and  g'  =  sint,  so  the  length  is 

/»27r      I   /•27r 

/     J{l-cost)^  +  sm'^tdt=  /     V2  -  2 cost  dt. 
Jo  Jo 

Now  we  use  the  formula  sin^(t/2)  =  (1  —  cos(t))/2  or  4sin^(t/2)  =  2  —  2 cost  to  get 


0 


'4sin^(t/2)  dt. 
Since  0  <  t  <  27r,  sin(t/2)  >  0,  so  we  can  rewrite  this  as 


/.27r 

/  2sin(t/2)  dt  =  8. 
Jo 


□ 
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Exercises  10.5. 

1.  Consider  the  curve  of  exercise  6  in  section  10.4.  Find  all  values  of  t  for  which  the  curve  has 
a  horizontal  tangent  line.  =^> 

2.  Consider  the  curve  of  exercise  6  in  section  10.4.  Find  the  area  under  one  arch  of  the  curve. 

=^ 

3.  Consider  the  curve  of  exercise  6  in  section  10.4.  Set  up  an  integral  for  the  length  of  one  arch 
of  the  curve.  =^> 

4.  Consider  the  hypercycloid  of  exercise  7  in  section  10.4.  Find  all  points  at  which  the  curve 
has  a  horizontal  tangent  line.  =J> 

5.  Consider  the  hypercycloid  of  exercise  7  in  section  10.4.  Find  the  area  between  the  large  circle 
and  one  arch  of  the  curve.  =f- 

6.  Consider  the  hypercycloid  of  exercise  7  in  section  10.4.  Find  the  length  of  one  arch  of  the 
curve.  =^ 

7.  Consider  the  hypocycloid  of  exercise  8  in  section  10.4.  Find  the  area  inside  the  curve.  =^ 

8.  Consider  the  hypocycloid  of  exercise  8  in  section  10.4.  Find  the  length  of  one  arch  of  the 
curve. 

9.  Recall  the  involute  of  a  circle  from  exercise  9  in  section  10.4.   Find  the  point  in  the  first 
quadrant  in  figure  10.4.4  at  which  the  tangent  line  is  vertical.  =^> 

10.  Recall  the  involute  of  a  circle  from  exercise  9  in  section  10.4.  Instead  of  an  infinite  string, 
suppose  we  have  a  string  of  length  tt  attached  to  the  unit  circle  at  (—1,0),  and  initially  laid 
around  the  top  of  the  circle  with  its  end  at  (1,0).  If  we  grasp  the  end  of  the  string  and  begin 
to  unwind  it,  we  get  a  piece  of  the  involute,  until  the  string  is  vertical.  If  we  then  keep  the 
string  taut  and  continue  to  rotate  it  counter-clockwise,  the  end  traces  out  a  semi-circle  with 
center  at  (—1,0),  until  the  string  is  vertical  again.  Continuing,  the  end  of  the  string  traces 
out  the  mirror  image  of  the  initial  portion  of  the  curve;  see  figure  10.5.1.  Find  the  area  of 
the  region  inside  this  curve  and  outside  the  unit  circle.  =^ 

11.  Find  the  length  of  the  curve  from  the  previous  exercise,  shown  in  figure  10.5.1.  =f- 

12.  Find  the  length  of  the  spiral  of  Archimedes  (figure  10.3.4)  for  0  <  0  <  27r. 


Figure  10.5.1     A  region  formed  by  the  end  of  a  string. 
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Consider  the  following  sum: 

1111  1 

The  dots  at  the  end  indicate  that  the  sum  goes  on  forever.  Does  this  make  sense?  Can 
we  assign  a  numerical  value  to  an  infinite  sum?  While  at  first  it  may  seem  difficult  or 
impossible,  we  have  certainly  done  something  similar  when  we  talked  about  one  quantity 
getting  "closer  and  closer"  to  a  fixed  quantity.  Here  we  could  ask  whether,  as  we  add  more 
and  more  terms,  the  sum  gets  closer  and  closer  to  some  fixed  value.  That  is,  look  at 

1 

2 

3 

4 

7 

8 
15 
16 

and  so  on,  and  ask  whether  these  values  have  a  limit.  It  seems  pretty  clear  that  they  do, 
namely  1.  In  fact,  as  we  will  see,  it's  not  hard  to  show  that 

111      1  1 


1 
2 

1  1 

2  +  4 

1  1  1 

2  +  4  +  8 

1111 

2  ^  4  ^  8  ^  16 
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and  then 


lim        —  =  1-  0  =  1. 


There  is  one  place  that  you  have  long  accepted  this  notion  of  infinite  sum  without  really 
thinking  of  it  as  a  sum: 


^ooooo      3        3         3  3  1 

0.33333  =  —  H  H  H  +  ...  =  -, 

10     100     1000     10000  3' 


for  example,  or 


o-..-..^         o      1        4         1  5  9 

3.14159...  =  3H  \  \  \  1  1  =  TT. 

10     100     1000     10000  100000 

Our  first  task,  then,  to  investigate  infinite  sums,  called  series,  is  to  investigate  limits  of 
sequences  of  numbers.  That  is,  we  officially  call 

^  1  _  1      1     1      1  1 

i—l 


a  series,  while 


is  a  sequence,  and 


1  3  7  15        2*  -  1 


2'4'8'16'      '    2»  ' 
2»  -  1 


lim 


2«     i^oo  2* 

i=l 


that  is,  the  value  of  a  series  is  the  limit  of  a  particular  sequence. 


11.1  Sequences 

While  the  idea  of  a  sequence  of  numbers,  ai,  a2,  as, . . .  is  straightforward,  it  is  useful  to 
think  of  a  sequence  as  a  function.  We  have  up  until  now  dealt  with  functions  whose  domains 
are  the  real  numbers,  or  a  subset  of  the  real  numbers,  like  f{x)  =  sinx.  A  sequence  is  a 
function  with  domain  the  natural  numbers  N  =  {1,  2,  3, . . .}  or  the  non-negative  integers, 
=  {0,1,2,3,...}.  The  range  of  the  function  is  still  allowed  to  be  the  real  numbers;  in 
symbols,  we  say  that  a  sequence  is  a  function  f-.N^M..  Sequences  are  written  in  a  few 
different  ways,  all  equivalent;  these  all  mean  the  same  thing: 

oi,  02,  as, . . . 

As  with  functions  on  the  real  numbers,  we  will  most  often  encounter  sequences  that 
can  be  expressed  by  a  formula.  We  have  already  seen  the  sequence     =  /(z)  =  1  —  1/2*, 
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and  others  are  easy  to  come  by: 

fin 


i  +  1 
1 

f{n)  =  sin(n7r/6) 


m 

Frequently  these  formulas  will  make  sense  if  thought  of  either  as  functions  with  domain 
or  N,  though  occasionally  one  will  make  sense  only  for  integer  values. 
Faced  with  a  sequence  we  are  interested  in  the  limit 


lim  f{i)  =  lim  a^. 

i—^oo  i—^oo 

We  already  understand 

lim  f{x) 

x—^oo 

when  a;  is  a  real  valued  variable;  now  we  simply  want  to  restrict  the  "input"  values  to  be 
integers.  No  real  difference  is  required  in  the  definition  of  limit,  except  that  we  specify,  per- 
haps implicitly,  that  the  variable  is  an  integer.  Compare  this  definition  to  definition  4.10.2. 

DEFINITION  11.1.1  Suppose  that  {a^}°°^i  is  a  sequence.  We  say  that  lim  =  L 
if  for  every  e  >  0  there  is  an      >  0  so  that  whenever  n  >  N,       —  L|  <  e.  If  lim      =  L 

71— >00 

we  say  that  the  sequence  converges,  otherwise  it  diverges.  □ 

If  /(z)  defines  a  sequence,  and  /(x)  makes  sense,  and  lim  /(x)  =  L,  then  it  is  clear 
that  lim  /(z)  =  L  as  well,  but  it  is  important  to  note  that  the  converse  of  this  statement 

i— >-oo 

is  not  true.  For  example,  since  Um  (1/x)  =  0,  it  is  clear  that  also  lim  (1/z)  =  0,  that  is, 
the  numbers 

111111 
1'  2'  3'  4'  5'  6''" 

get  closer  and  closer  to  0.  Consider  this,  however:  Let  f{n)  =  sin(77.7r).  This  is  the  sequence 

sin(07r),  sin(l7r),  sin(27r),  sin(37r), . .  .  =  0,  0,  0,  0, . . . 
since  sin(n7r)  =  0  when  n  is  an  integer.  Thus  lim  /(n)  =  0.  But  lim  /(x),  when  x  is 

n—^oo  x—^oo 

real,  does  not  exist:  as  x  gets  bigger  and  bigger,  the  values  sin(x7r)  do  not  get  closer  and 
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closer  to  a  single  value,  but  take  on  all  values  between  —1  and  1  over  and  over.  In  general, 
whenever  you  want  to  know  lim  /(n)  you  should  first  attempt  to  compute  lim  /(x), 

n—^oo  x—^oo 

since  if  the  latter  exists  it  is  also  equal  to  the  first  limit.  But  if  for  some  reason  lim  f{x) 
does  not  exist,  it  may  still  be  true  that  lim  /(n)  exists,  but  you'll  have  to  figure  out 

n—^oo 

another  way  to  compute  it. 

It  is  occasionally  useful  to  think  of  the  graph  of  a  sequence.  Since  the  function  is 
defined  only  for  integer  values,  the  graph  is  just  a  sequence  of  dots.  In  figure  11.1.1  we  see 
the  graphs  of  two  sequences  and  the  graphs  of  the  corresponding  real  functions. 


fin)  =  1/n 


1/x 


10 


10 


/(n)  =  sin(n7r) 


-t  1  1  1  1  f  1  1 

12345678 


f{x)  =  sin(x7r) 


Figure  11.1.1     Graphs  of  sequences  and  their  corresponding  real  functions. 

Not  surprisingly,  the  properties  of  limits  of  real  functions  translate  into  properties  of 
sequences  quite  easily.  Theorem  2.3.6  about  limits  becomes 


THEOREM  11.1.2   Suppose  that  lim  an  —  L  and  lim  bn  —  M  and  k  is  some  constant. 

n—^oo  n— >oo 

Then 


lim  kan  =  k  lim       =  kL 

n— >oo  n— >oo 


lim  (ttn  +  bn)  =  lim  an  +  lim  bn  =  L  +  M 


n— >-cxD  n— >-cxD  n— >cxd 


lim  (ttn  —  br 


lim  ttn  —  lim  6„  =  L  —  M 


'"n  J         111J.J.  u,n         lilh  un 

n— >-oo  n— >oo  n— >oo 

lim  (ttnbn)  =  lim      ■  lim  bn  =  LM 

n—^oo  n—^oo  n—^oo 


lim 


O-n  lini„  _i.no  Q-n  L 


n^oo  bn         lim^^oo  K  M 


if  M  is  not  0 


Likewise  the  Squeeze  Theorem  (4.3.1)  becomes 
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THEOREM  11.1.3    Suppose  that      <  6n  <      for  all  n>  N,  for  some  A^.  If  lim  = 


lim  Cn  =  L,  then  lim  bn  =  L.  ■ 

n— >oo  n— >oo 

And  a  final  useful  fact: 
THEOREM  11.1.4      lim  ja^l  =  0  if  and  only  if  lim       =  0.  ■ 

rt— >-oo  n— >oo 

This  says  simply  that  the  size  of  gets  close  to  zero  if  and  only  if  gets  close  to 
zero. 

EXAMPLE  11.1.5    Determine  whether  <    >       converges  or  diverges.  If  it  con- 

W  +  1  Jn=0 

verges,  compute  the  limit.  Since  this  makes  sense  for  real  numbers  we  consider 

x  1 
lim    =  lim  1  =  1  —  0  =  1. 

x^oo  X  +  1        x^oo  X  +  1 

Thus  the  sequence  converges  to  1.  □ 

f  In  77.  "1  ^ 

EXAMPLE  11.1.6    Determine  whether  <   >       converges  or  diverges.  If  it  con- 


77, 


n=l 


verges,  compute  the  limit.  We  compute 


Inx  1/x 
lim           =  lim           =  0, 


x—^oo     X  X— >oo 


using  L'Hopital's  Rule.  Thus  the  sequence  converges  to  0.  □ 

EXAMPLE  11.1.7  Determine  whether  {(— l)"}5^o  converges  or  diverges.  If  it  con- 
verges, compute  the  limit.  This  does  not  make  sense  for  all  real  exponents,  but  the  sequence 
is  easy  to  understand:  it  is 

1,-1,1,-1,1... 

and  clearly  diverges.  □ 

EXAMPLE  11.1.8  Determine  whether  {(— 1/2)"}5^q  converges  or  diverges.  If  it  con- 
verges, compute  the  limit.  We  consider  the  sequence  {|(-1/2)"^|}5^q  =  {(l/2)'^}5^o-  Then 

fiy  1 

lim     -      =  lim  —  =0, 

X— >-CXD    \  2  /  X— >-CXD  2^ 

SO  by  theorem  11.1.4  the  sequence  converges  to  0.  □ 
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EXAMPLE  11.1.9  Determine  whether  {(sinn)/v^}5^i  converges  or  diverges.  If  it 
converges,  compute  the  hmit.  Since  |  sinn|  <  1,  0  <  |  sinn/v^|  <  l/y/n  and  we  can  use 
theorem  11.1.3  with      =  0  and  c^t  =  Xj^fn.  Since  Hm       =  hm  Cn  =  0,  hm  sinn/v^  = 

0  and  the  sequence  converges  to  0.  □ 

EXAMPLE  11.1.10  A  particularly  common  and  useful  sequence  is  {r"^}^Q,  for  various 
values  of  r.  Some  are  quite  easy  to  understand:  If  r  =  1  the  sequence  converges  to  1  since 
every  term  is  1,  and  likewise  if  r  =  0  the  sequence  converges  to  0.  If  r  =  —  1  this  is 
the  sequence  of  example  11.1.7  and  diverges.  Ifr>lorr<— 1  the  terms  get  large 
without  limit,  so  the  sequence  diverges.  If  0  <  r  <  1  then  the  sequence  converges  to  0. 
If  —1  <  r  <  0  then  \r'^\  =  |r|"  and  0  <  |r|  <  1,  so  the  sequence  {l^'PjJ^o  converges  to 
0,  so  also  {r"^}5^o  converges  to  0.  converges.  In  summary,  {r"^}  converges  precisely  when 
—  1  <  r  <  1  in  which  case 

0    if  -1<  r  <  1 

□ 


lim  : 

n^oo  LI    if  r  =  1 


Sometimes  we  will  not  be  able  to  determine  the  limit  of  a  sequence,  but  we  still  would 
like  to  know  whether  it  converges.  In  some  cases  we  can  determine  this  even  without  being 
able  to  compute  the  limit. 

A  sequence  is  called  increasing  or  sometimes  strictly  increasing  if      <  a^+i  for 
all  i.  It  is  called  non-decreasing  or  sometimes  (unfortunately)  increasing  if  ai  <  a^+i 
for  all  i.  Similarly  a  sequence  is  decreasing  if     >  ai_|_i  for  all  i  and  non-increasing  if 
>  CLi+i  for  all  i.  If  a  sequence  has  any  of  these  properties  it  is  called  monotonic. 

EXAMPLE  11.1.11     The  sequence 

2^  -  1 1  °°       1  3  7  15 


is  increasing,  and 


.  ,      2'  4'  8'  16' 

1=1 


n  +  ll°°       2  3  4  5 


n     I  .  1      1'  2'  3'  4''" 


is  decreasing.  □ 

A  sequence  is  bounded  above  if  there  is  some  number  N  such  that  an  <  N  for  every 
n,  and  bounded  below  if  there  is  some  number  such  that  an  >  N  for  every  n.  If  a 
sequence  is  bounded  above  and  bounded  below  it  is  bounded.  If  a  sequence  {flnj^^o 
increasing  or  non-decreasing  it  is  bounded  below  (by  ao),  and  if  it  is  decreasing  or  non- 
increasing  it  is  bounded  above  (by  ao).  Finally,  with  all  this  new  terminology  we  can  state 
an  important  theorem. 
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THEOREM  11.1.12    If  a  sequence  is  bounded  and  monotonic  then  it  converges.  ■ 

We  will  not  prove  this;  the  proof  appears  in  many  calculus  books.  It  is  not  hard  to 
believe:  suppose  that  a  sequence  is  increasing  and  bounded,  so  each  term  is  larger  than  the 
one  before,  yet  never  larger  than  some  fixed  value  A^.  The  terms  must  then  get  closer  and 
closer  to  some  value  between  oq  and  A^.  It  need  not  be  A^,  since  may  be  a  "too-generous" 
upper  bound;  the  limit  will  be  the  smallest  number  that  is  above  all  of  the  terms  a^. 

EXAMPLE  11.1.13  All  of  the  terms  {T  -  l)/2^  are  less  than  2,  and  the  sequence  is 
increasing.  As  we  have  seen,  the  limit  of  the  sequence  is  1 — 1  is  the  smallest  number  that 
is  bigger  than  all  the  terms  in  the  sequence.  Similarly,  all  of  the  terms  (n  +  l)/n  are  bigger 
than  1/2,  and  the  limit  is  1 — 1  is  the  largest  number  that  is  smaller  than  the  terms  of  the 
sequence.  □ 

We  don't  actually  need  to  know  that  a  sequence  is  monotonic  to  apply  this  theorem — 
it  is  enough  to  know  that  the  sequence  is  "eventually"  monotonic,  that  is,  that  at  some 
point  it  becomes  increasing  or  decreasing.  For  example,  the  sequence  10,  9,  8,  15,  3,  21,  4, 
3/4,  7/8,  15/16,  31/32, ...  is  not  increasing,  because  among  the  first  few  terms  it  is  not. 
But  starting  with  the  term  3/4  it  is  increasing,  so  the  theorem  tells  us  that  the  sequence 
3/4,  7/8, 15/16,  31/32, .. .  converges.  Since  convergence  depends  only  on  what  happens  as 
n  gets  large,  adding  a  few  terms  at  the  beginning  can't  turn  a  convergent  sequence  into  a 
divergent  one. 

EXAMPLE  11.1.14    Show  that  {n^l"^}  converges. 

We  first  show  that  this  sequence  is  decreasing,  that  is,  that  n^^"  >  (n  +  l)^/*-"^^^-'.  Consider 
the  real  function  f{x)  =  x^l^  when  a;  >  1.  We  can  compute  the  derivative,  f'{x)  = 
a;^'^^(l— lnx)/x^,  and  note  that  when  a:  >  3  this  is  negative.  Since  the  function  has  negative 
slope,  n^/"  >  (n  +  l)V('^+i)  ^hen  n  >  3.  Since  all  terms  of  the  sequence  are  positive,  the 
sequence  is  decreasing  and  bounded  when  n  >  3,  and  so  the  sequence  converges.  (As  it 
happens,  we  can  compute  the  limit  in  this  case,  but  we  know  it  converges  even  without 
knowing  the  limit;  see  exercise  1.)  □ 

EXAMPLE  11.1.15     Show  that  {nl/n'"}  converges. 

Again  we  show  that  the  sequence  is  decreasing,  and  since  each  term  is  positive  the  sequence 
converges.  We  can't  take  the  derivative  this  time,  as  x\  doesn't  make  sense  for  x  real.  But 
we  note  that  if  a^+i/a^  <  1  then  a„+i  <  a„,  which  is  what  we  want  to  know.  So  we  look 
at  an^xjan- 


a^-i-i        (ti  +  I)!  (n  +  1)!  n  +  1  f    ti    \"      /  n 


an        {n  +  lY+^  n\  n\      (n  +  l)'^+i      n  +  lV^  +  1/        \n  +  l 


<  1. 
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(Again  it  is  possible  to  compute  the  limit;  see  exercise  2.)  □ 
Exercises  11.1. 


1.  Compute  lim  x 


n! 

2.  Use  the  squeeze  theorem  to  show  that  hm  — '-  =  0. 

3.  Determine  whether  {\/ n  +  47  —  \Ai}n^o  converges  or  diverges.  If  it  converges,  compute  the 
hmit.  =f- 

(       2    I         'I  ^ 

4.  Determine  whether  <  —  >       converges  or  diverges.  If  it  converges,  compute  the  hmit. 

^  l(^  +  l)'J„=o 

f    n  +  47  ]°° 

5.  Determine  whether  <  —  >        converges  or  diverges.    If  it  converges,  compute  the 

I         +  3n  J 

hmit.  =^ 

6.  Determine  whether  <  — -  >       converges  or  diverges.  =^ 

I  ^!  i  n=0 

11.2  Series 

While  much  more  can  be  said  about  sequences,  we  now  turn  to  our  principal  interest, 
series.  Recall  that  a  series,  roughly  speaking,  is  the  sum  of  a  sequence:  if  {an}^=o  ^ 
sequence  then  the  associated  series  is 


ttn  =  ao  +  ai  +  a2  +  ■  ■  ■ 


i=0 


Associated  with  a  series  is  a  second  sequence,  called  the  sequence  of  partial  sums 


n 

i=0 


So 

So  =  ao,     si  =  ao  +  ai,    S2  =  ao  +  ai  +  a2, 

A  series  converges  if  the  sequence  of  partial  sums  converges,  and  otherwise  the  series 
diverges. 

oo 

EXAMPLE  11.2.1    If  ttn  =  kx"",  is  called  a  geometric  series.  A  typical  partial 

n=0 

sum  is 


Sn  =  k  +  kx  +  kx''  +  kx'^  -\  h  kx"-  =  k{l  +  x  +  x"^  +  x^  -\  h  x 
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We  note  that 

-  x)  =  k{l  +  X  +  x'^  +     -i  h  -  x) 

^  k{l  +  X  +  x'^  +  x^  -{  [-  x"")!  -  k{l  +  X  +  x'^  +  x^  -{  h  x""^  + 

^  k{l  +  X  +  x'^  +  x^  -{  h  x'^  -  X  -     -   x"  -  x^'^^) 


so 


-x)  =  A;(l  -x'^+^) 
1  - 


■Sn,  —  k- 


1  —  X 


If  |x|  <  1,  lim  x""  =  0  so 

n— >-oo 


]^  _  x^~^^  1 

lim      =  lim  k   =  k  . 

n— >-oo  n^oo         1  —  x  1  —  X 

Thus,  when  |x|  <  1  the  geometric  series  converges  to  k/{l  —  x).  When,  for  example,  k  =  1 
and  X  =  1/2: 

1  -  (1/2)"+^      2"+^  -  1  ^         ,     X  ■  - 

-  -2-—    and>—  =  —  =  2. 


1  -  1/2  2^  2^^  ^  2"      1  -  1/2 

'  n=0  ' 


—    and  —  =   

2n  1-1/ 


We  began  the  chapter  with  the  series 

or 


2' 

n=l 


namely,  the  geometric  series  without  the  first  term  1.  Each  partial  sum  of  this  series  is  1 
less  than  the  corresponding  partial  sum  for  the  geometric  series,  so  of  course  the  limit  is 
also  one  less  than  the  value  of  the  geometric  series,  that  is. 


oo  ^ 
Z-^  On 


2 

n=l 

□ 

It  is  not  hard  to  see  that  the  following  theorem  follows  from  theorem  11.1.2. 

THEOREM  11.2.2  Suppose  that  '^an  and  J^^n  are  convergent  series,  and  c  is  a 
constant.  Then 


1.  ^^ca^  is  convergent  and  ^^ca^  =  c^^a^ 
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2.  ^(on  +  bn)  is  convergent  and  ^(a^  +  bn)  =  ^     +  ^  bn- 

■ 

The  two  parts  of  this  theorem  are  subtly  different.  Suppose  that  ^  a„  diverges;  does 
cttn  also  diverge  if  c  is  non-zero?  Yes:  suppose  instead  that  ^  can  converges;  then  by 
the  theorem,  ^(l/c)can  converges,  but  this  is  the  same  as  ^a^,  which  by  assumption 
diverges.  Hence  Yl  ccin  also  diverges.  Note  that  we  are  applying  the  theorem  with  a„ 
replaced  by  ca„  and  c  replaced  by  (1/c). 

Now  suppose  that  Y'^n  and  J2bn  diverge;  does  Yi'^n  +  bn)  also  diverge?  Now  the 
answer  is  no:  Let  =  1  and  6^  =  —  1,  so  certainly  Yl  '^n  and  Y  bn  diverge.  But  Yi'^n  + 
bn)  =  Yi^  "I — 1)  =  XI  0  =  0-  C)f  course,  sometimes  +  ^n)  will  also  diverge,  for 

example,  if  an  =  bn  =  1,  then  Yi'^n  +  ^n)  =  Yi^  +  1)  =  X]  2  diverges. 

In  general,  the  sequence  of  partial  sums  Sn  is  harder  to  understand  and  analyze  than 
the  sequence  of  terms  a^,  and  it  is  difficult  to  determine  whether  series  converge  and  if  so 
to  what.  Sometimes  things  are  relatively  simple,  starting  with  the  following. 

THEOREM  11.2.3    If  XI      converges  then  lim      =  0. 

n— >oo 

Proof.     Since  Y'^n  converges,  lim  Sn  =  L  and  lim  Sn-i  =  L,  because  this  really  says 

n— >-oo  n— >-oo 

the  same  thing  but  "renumbers"  the  terms.  By  theorem  11.1.2, 

lim  [sn  —  Sn-i)  =  lim  Sn  —  lim  Sn-i  =  L  —  L  =  0. 

n— >oo  n— >oo  n—^oo 

But 

Sn  -  Sn-1  =  (ao  +  ai  +  a2  H  h  a^)  -  (ao  +  ai  +  02  H  h  a^-i)  =  a^, 

so  as  desired  lim  an  —  0-  ■ 

n— >oo 


This  theorem  presents  an  easy  divergence  test:  if  given  a  series  X]  <^n  the  limit  lim 

n^oo 

does  not  exist  or  has  a  value  other  than  zero,  the  series  diverges.  Note  well  that  the 
converse  is  not  true:  If  lim  an  =  0  then  the  series  does  not  necessarily  converge. 


00 

n 


EXAMPLE  11.2.4     Show  that  V   diverges. 

^  n  +  1 

n=l 

We  compute  the  limit: 


Tl 

lim   =  1^0. 

n^oo  n  -f  1 

Looking  at  the  first  few  terms  perhaps  makes  it  clear  that  the  series  has  no  chance  of 
converging: 

12     3  4 

2+3+4+5+- 
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will  just  get  larger  and  larger;  indeed,  after  a  bit  longer  the  series  starts  to  look  very  much 
like  +  1  +  1  +  1  +  1  +  and  of  course  if  we  add  up  enough  I's  we  can  make  the  sum 
as  large  as  we  desire.  □ 


CO  ^ 

EXAMPLE  11.2.5     Show  that  V  -  diverges. 


n 

n=l 


Here  the  theorem  does  not  apply:    lim  1/n  =  0,  so  it  looks  like  perhaps  the  series  con- 

verges.  Indeed,  if  you  have  the  fortitude  (or  the  software)  to  add  up  the  first  1000  terms 
you  will  find  that 

1000  ^ 

7.49, 

n=l 

SO  it  might  be  reasonable  to  speculate  that  the  series  converges  to  something  in  the  neigh- 
borhood of  10.  But  in  fact  the  partial  sums  do  go  to  infinity;  they  just  get  big  very,  very 
slowly.  Consider  the  following: 

1  111,111,11 

1111111  1111111  111 

l+2+3+4+5+6+7+8> 1+2+4+4+8+8+8+8= 1+2+2+2 

11  1  1111  11  1  1111 

l+2  +  3  +  '''  +  T6>l+2  +  4  +  4  +  8+---  +  8  +  T6  +  '"+16  =  l+2  +  2  +  2  +  2 

and  so  on.  By  swallowing  up  more  and  more  terms  we  can  always  manage  to  add  at  least 
another  1/2  to  the  sum,  and  by  adding  enough  of  these  we  can  make  the  partial  sums  as 
big  as  we  like.  In  fact,  it's  not  hard  to  see  from  this  pattern  that 


11  1  n 

2  +  3+---+^>l+2, 


SO  to  make  sure  the  sum  is  over  100,  for  example,  we'd  add  up  terms  until  we  get  to 
around  1/2^^^,  that  is,  about  4  ■  10^*^  terms.  This  series,  is  called  the  harmonic 

series.  □ 


Exercises  11.2. 

2n2  +  1 


1.  Explain  why  -—-tt — 7  diverges. 


n=l 


5 

2.  Explain  why  >  — —   diverges. 


21/"  +  14 

00  3 


3.  Explain  why        —  diverges. 


n 

n=l 
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4.  Compute 

n=0 

oo 

6.   Compute  ^ 


(-3)"  3" 
4" 


Tl  =  0 

OO  +  l 


n=0 

OG 


8.  Compute 


.  ^^  3  4 
5.    Compute  >  1  


n=0 


7.  Compute^—. 


n=0 

CXD 


9.  Compute^— ^. 


11.3    The  Integral  Test 

It  is  generally  quite  difficult,  often  impossible,  to  determine  the  value  of  a  series  exactly. 
In  many  cases  it  is  possible  at  least  to  determine  whether  or  not  the  series  converges,  and 
so  we  will  spend  most  of  our  time  on  this  problem. 

If  all  of  the  terms  in  a  series  are  non-negative,  then  clearly  the  sequence  of  partial 
sums  Sn  is  non-decreasing.  This  means  that  if  we  can  show  that  the  sequence  of  partial 
sums  is  bounded,  the  series  must  converge.  We  know  that  if  the  series  converges,  the 
terms  approach  zero,  but  this  does  not  mean  that  a„  >  an+i  for  every  n.  Many  useful 
and  interesting  series  do  have  this  property,  however,  and  they  are  among  the  easiest  to 
understand.  Let's  look  at  an  example. 


EXAMPLE  11.3.1 


oo  ^ 

Show  that  >  converges. 


n=l 


The  terms  are  positive  and  decreasing,  and  since  lim  =  0,  the  terms 

x—^oo 

approach  zero.  We  seek  an  upper  bound  for  all  the  partial  sums,  that  is,  we  want  to 
find  a  number  so  that  Sn  <  N  for  every  n.  The  upper  bound  is  provided  courtesy  of 
integration,  and  is  inherent  in  figure  11.3.1. 


Figure  11.3.1     Graph  of  y  =         with  rectangles. 


11.3    The  Integral  Test  269 


The  figure  shows  the  graph  oi  y  =  together  with  some  rectangles  that  lie  com- 
pletely below  the  curve  and  that  all  have  base  length  one.  Because  the  heights  of  the 
rectangles  are  determined  by  the  height  of  the  curve,  the  areas  of  the  rectangles  are  1/1^, 
1/2^,  l/3^  and  so  on — in  other  words,  exactly  the  terms  of  the  series.  The  partial  sum 
Sn  is  simply  the  sum  of  the  areas  of  the  first  n  rectangles.  Because  the  rectangles  all  lie 
between  the  curve  and  the  x-axis,  any  sum  of  rectangle  areas  is  less  than  the  corresponding 
area  under  the  curve,  and  so  of  course  any  sum  of  rectangle  areas  is  less  than  the  area 
under  the  entire  curve,  that  is,  all  the  way  to  infinity.  There  is  a  bit  of  trouble  at  the  left 
end,  where  there  is  an  asymptote,  but  we  can  work  around  that  easily.  Here  it  is: 

111  1         r  1  ,         /"°°  1  , 

1^     2^     3^  Ji   x'^  Ji  x'^ 

recalling  that  we  computed  this  improper  integral  in  section  9.7.  Since  the  sequence  of 
partial  sums  Sn  is  increasing  and  bounded  above  by  2,  we  know  that  lim      =  L  <  2,  and 

n— >oo 

SO  the  series  converges  to  some  number  less  than  2.  In  fact,  it  is  possible,  though  difficult, 
to  show  that  L  =  7r^/6  ~  1.6.  □ 

We  already  know  that  "^l/n  diverges.  What  goes  wrong  if  we  try  to  apply  this 
technique  to  it?  Here's  the  calculation: 

111  1         r  1  I  , 

Sn  =  zr  +  -  +  -^  h-<l+/    -dx  <1+        -dx  =  l  +  oo. 

12     3  n  Ji   X  Ji  X 

The  problem  is  that  the  improper  integral  doesn't  converge.  Note  well  that  this  does 
not  prove  that  diverges,  just  that  this  particular  calculation  fails  to  prove  that  it 

converges.  A  slight  modification,  however,  allows  us  to  prove  in  a  second  way  that  X]  1/^ 
diverges. 

EXAMPLE  11.3.2  Consider  a  slightly  altered  version  of  figure  11.3.1,  shown  in  fig- 
ure 11.3.2. 


1 


0 


A  =1^^ 

A  =  1/2 

A  =  1/3 

1 

0  1  2  3  4  5 


Figure  11.3.2     Graph  of  y  =  1/x  with  rectangles. 
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The  rectangles  this  time  are  above  the  curve,  that  is,  each  rectangle  completely  contains 
the  corresponding  area  under  the  curve.  This  means  that 


111        1         1  ,    ,  ri+i 

^n=T  +  -  +  --\  1-->  /       -iia;  =  lna; 

12      6  n     .h  X 


=  ln(n+l). 


As  n  gets  bigger,  ln(n  +  1)  goes  to  infinity,  so  the  sequence  of  partial  sums  Sn  must  also 
go  to  infinity,  so  the  harmonic  series  diverges.  □ 

The  important  fact  that  clinches  this  example  is  that 

f-n+l 


/■n-i-i 

lim   /       —  dx  =  oo, 

n->-ooJ-^  X 


which  we  can  rewrite  as 

'  1 


dx  =  oo. 

X 

So  these  two  examples  taken  together  indicate  that  we  can  prove  that  a  series  converges 
or  prove  that  it  diverges  with  a  single  calculation  of  an  improper  integral.  This  is  known 
as  the  integral  test,  which  we  state  as  a  theorem. 

THEOREM  11.3.3    Suppose  that  f{x)  >  0  and  is  decreasing  on  the  infinite  interval 

oo 

[k,oo)  (for  some  k  >  1)  and  that      —  f{n).  Then  the  series       a„  converges  if  and  only 


/oo 
f{x)  dx  converges. 


n=l 


The  two  examples  we  have  seen  are  called  p-series;  a  p-series  is  any  series  of  the  form 

El/n^.  If  p  <  0,  lim  1/n^  7^  0,  so  the  series  diverges.  For  positive  values  of  p  we  can 
n— >oo 

determine  precisely  which  series  converge. 

THEOREM  11.3.4    A  p-series  with  p  >  0  converges  if  and  only  if  p  >  1. 

Proof.    We  use  the  integral  test;  we  have  already  done  p  —  1,  so  assume  that  p  ^  1- 


i: 


°°  1  x^~'P 
dx  =  lim 


xP  D^oo  1  —  p 


D 


=  lim 


D-i-oo  1  —  p       1  —  P 


If  p  >  1  then  1  —  p  <  0  and  lim        ^  =  0,  so  the  integral  converges.  If  0  <  p  <  1  then 

D^oo 

1  —p  >  0  and  lim  D^~^  =  oo,  so  the  integral  diverges.  ■ 

r>— >-oo 
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oo  ^ 

EXAMPLE  11.3.5     Show  that  converges 


n=l 


We  could  of  course  use  the  integral  test,  but  now  that  we  have  the  theorem  we  may  simply 
note  that  this  is  a  p-series  with  p>  1.  □ 


oo  _ 

5 


EXAMPLE  11.3.6     Show  that  V  —  converges. 

n=l 

oo  oo 

We  know  that  if  converges  then  also  converges,  by  theorem  11.2.2.  Since 

oo  oo 

is  a  convergent  p-series,  converges  also.  □ 


n=l  71=1 


oo  _ 

5 


EXAMPLE  11.3.7    Show  that  diverges. 

^-^  Jn 

n=l 

oo  ^ 

This  also  follows  from  theorem  11.2.2:  Since  y    —j=  is  a  p-series  with  p  =  1/2  <  1,  it 


n=l 

oo  _ 

diverges,  and  so  does  ^  □ 


rges,  and  so  does  —=. 


Since  it  is  typically  difficult  to  compute  the  value  of  a  series  exactly,  a  good  approx- 
imation is  frequently  required.  In  a  real  sense,  a  good  approximation  is  only  as  good  as 
we  know  it  is,  that  is,  while  an  approximation  may  in  fact  be  good,  it  is  only  valuable  in 
practice  if  we  can  guarantee  its  accuracy  to  some  degree.  This  guarantee  is  usually  easy 
to  come  by  for  series  with  decreasing  positive  terms. 

EXAMPLE  11.3.8     Approximate  ^^1/n^  to  two  decimal  places. 

N 

Referring  to  figure  11.3.1,  if  we  approximate  the  sum  by       1/?^^,  the  error  we  make  is 


n=l 


the  total  area  of  the  remaining  rectangles,  all  of  which  lie  under  the  curve  l/x^  from  x  =  N 
out  to  infinity.  So  we  know  the  true  value  of  the  series  is  larger  than  the  approximation, 
and  no  bigger  than  the  approximation  plus  the  area  under  the  curve  from  to  infinity. 
Roughly,  then,  we  need  to  find      so  that 


oo  2 

^dx  <  1/100. 

AT  X 
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We  can  compute  the  integral: 


x2'^^  N-' 


'N 

SO  N  =  100  is  a  good  starting  point.  Adding  up  the  first  100  terms  gives  approximately 
1.634983900,  and  that  plus  1/100  is  1.644983900,  so  approximating  the  series  by  the  value 
halfway  between  these  will  be  at  most  1/200  =  0.005  in  error.  The  midpoint  is  1.639983900, 
but  while  this  is  correct  to  ±0.005,  we  can't  tell  if  the  correct  two-decimal  approximation 
is  1.63  or  1.64.  We  need  to  make  N  big  enough  to  reduce  the  guaranteed  error,  perhaps  to 
around  0.004  to  be  safe,  so  we  would  need  1/A^  ~  0.008,  or  N  —  125.  Now  the  sum  of  the 
first  125  terms  is  approximately  1.636965982,  and  that  plus  0.008  is  1.644965982  and  the 
point  halfway  between  them  is  1.640965982.  The  true  value  is  then  1.640965982±0.004,  and 
all  numbers  in  this  range  round  to  1.64,  so  1.64  is  correct  to  two  decimal  places.  We  have 
mentioned  that  the  true  value  of  this  series  can  be  shown  to  be  ^  1.644934068  which 
rounds  down  to  1.64  (just  barely)  and  is  indeed  below  the  upper  bound  of  1.644965982, 
again  just  barely.  Frequently  approximations  will  be  even  better  than  the  "guaranteed" 
accuracy,  but  not  always,  as  this  example  demonstrates.  □ 

Exercises  11.3. 

Determine  whether  each  series  converges  or  diverges. 

CXD  OO 

\  -     1  ^     \  -  n 

1-      >   77  ^  2.     >     —   ^ 

OO      ,  OO  . 

Emn  ,      v-^  1 

^  n?  +  1 

n=l  n=\ 

CXD  CXD 

5.  6. 

Z  J  gn  Z  ^  gTi 

CXD  ^  ^1 

-i —  ^  8.  y  — — ^  =^ 

n  =  2  n=2      ^  ' 

N      ,  Af 


I  \  1 

9.  Find  an  A'^  so  that  >  — r  is  between  >    —  and  >  — r  +  0.005. 

n  =  l  n  =  l  n=l 

10.  Find  an  A'^  so  that  >^  —  is  between        —  and   h  10^^.  : 


e"  ^ — '  e"  ^ — '  e'' 

71  =  0  71  =  0  77  =  0 


OO  N  N 

,    -n-    1       AT       1      \  ^  Inn  .   ,  \  -  Inn       ,  \  -  Inn 

11.  l<ind  an  A*  so  that  >  — —  is  between  >  — —  and  >  — - — h  0.005. 

n=X  n=\  77  =  1 


OO  ^  ^1  ^1 

12.  Find  an  N  so  that        —  — -  is  between        —  — ■  and        —  -r  +  0.005. 


_  n(lnn)-^  n(lnn)2  n(lnn 
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11.4    Alternating  Series 

Next  we  consider  series  with  both  positive  and  negative  terms,  but  in  a  regular  pattern: 
they  alternate,  as  in  the  alternating  harmonic  series  for  example: 


E 

n=l 


-1 


,n-l 


n 


1-11-1  111 


1 


+ 


In  this  series  the  sizes  of  the  terms  decrease,  that  is,  |a„|  forms  a  decreasing  sequence, 
but  this  is  not  required  in  an  alternating  series.  As  with  positive  term  series,  however, 
when  the  terms  do  have  decreasing  sizes  it  is  easier  to  analyze  the  series,  much  easier,  in 
fact,  than  positive  term  series.  Consider  pictorially  what  is  going  on  in  the  alternating 
harmonic  series,  shown  in  figure  11.4.1.  Because  the  sizes  of  the  terms  are  decreasing, 
the  partial  sums  si,  S3,  S5,  and  so  on,  form  a  decreasing  sequence  that  is  bounded  below 
by  S2,  so  this  sequence  must  converge.  Likewise,  the  partial  sums  S2,  S4,  sq,  and  so  on, 
form  an  increasing  sequence  that  is  bounded  above  by  si,  so  this  sequence  also  converges. 
Since  all  the  even  numbered  partial  sums  are  less  than  all  the  odd  numbered  ones,  and 
since  the  "jumps"  (that  is,  the  terms)  are  getting  smaller  and  smaller,  the  two  sequences 
must  converge  to  the  same  value,  meaning  the  entire  sequence  of  partial  sums  si,  S2,  S3, . . . 
converges  as  well. 


02 


S2 


04 


S4 


S6 


as 


S5 


S3 


1  =  si  =  ai 


as 

Figure  11.4.1     The  alternating  harmonic  series. 

There's  nothing  special  about  the  alternating  harmonic  series — the  same  argument 
works  for  any  alternating  sequence  with  decreasing  size  terms.  The  alternating  series  test 
is  worth  calling  a  theorem. 


THEOREM  11.4.1     Suppose  that  {a„}^^  ]^  is  a  non-increasing  sequence  of  positive 

00 

numbers  and  lim      =  0.  Then  the  alternating  series  N   (— l)"^~^a^  converges. 

71. — ^nn  t  ^ 


n=l 


Proof.  The  odd  numbered  partial  sums,  si,  S3,  S5,  and  so  on,  form  a  non-increasing 
sequence,  because  S2fc+3  =  ■S2fc+i  -  02/0+2  +  a2fc+3  <  S2fc+i,  since  a2fc+2  >  a2fc+3-  This 
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sequence  is  bounded  below  by  S2,  so  it  must  converge,  say  lim  S2k+i  =  L.  Likewise, 

fc— >oo 

the  partial  sums  S2,  S4,  S6)  cind  so  on,  form  a  non-decreasing  sequence  that  is  bounded 
above  by  si,  so  this  sequence  also  converges,  say  lim  S2k  =  M.  Since  lim  a„  =  0  and 

L  =  lim  S2fe+i  =  lim  {s2k  +  a2fc+i)  =  hm  S2A;  +  lim  a2fe+i  =  M  +  0  =  M, 

fc— >-oo  fc— >-oo  fc— >-oo  fc— >-oo 

so  L  =  M,  the  two  sequences  of  partial  sums  converge  to  the  same  limit,  and  this  means 
the  entire  sequence  of  partial  sums  also  converges  to  L.  ■ 

Another  useful  fact  is  implicit  in  this  discussion.  Suppose  that 

00 

n=l 

and  that  we  approximate  L  by  a  finite  part  of  this  sum,  say 

N 
n=l 

Because  the  terms  are  decreasing  in  size,  we  know  that  the  true  value  of  L  must  be  between 
this  approximation  and  the  next  one,  that  is,  between 

N  N+1 

J2{-ir-'an    and  J](-l)-ia„. 

n=l  n=l 

Depending  on  whether  N  is  odd  or  even,  the  second  will  be  larger  or  smaller  than  the  first. 

EXAMPLE  11.4.2    Approximate  the  alternating  harmonic  series  to  one  decimal  place. 

We  need  to  go  roughly  to  the  point  at  which  the  next  term  to  be  added  or  subtracted 
is  1/10.  Adding  up  the  first  nine  and  the  first  ten  terms  we  get  approximately  0.746  and 
0.646.  These  are  1/10  apart,  but  it  is  not  clear  how  the  correct  value  would  be  rounded.  It 
turns  out  that  we  are  able  to  settle  the  question  by  computing  the  sums  of  the  first  eleven 
and  twelve  terms,  which  give  0.737  and  0.653,  so  correct  to  one  place  the  value  is  0.7.  □ 

We  have  considered  alternating  series  with  first  index  1,  and  in  which  the  first  term  is 

positive,  but  a  little  thought  shows  this  is  not  crucial.  The  same  test  applies  to  any  similar 

00  00  00 

series,  such  as  ^(  — l)^'^a„,  ^(— l)"an,  ^  {  —  l)^an,  etc. 

n=0  n—1  n=17 
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Exercises  11.4- 

Determine  whether  the  following  series  converge  or  diverge. 

1.        — ^  2.  y^^iL 


3-  E(-ir^^  -  ^-  E(-ir^^ 

5.  Approximate  ^^f— 1)"^^— to  two  decimal  places 


=  1 


6.  Approximate        — 1)"  to  two  decimal  places. 


11.5    Comparison  Tests 

As  we  begin  to  compile  a  list  of  convergent  and  divergent  series,  new  ones  can  sometimes 
be  analyzed  by  comparing  them  to  ones  that  we  already  understand. 


EXAMPLE  11.5.1     Does  V       —  converge? 

^-^      In  n 

n=2 

The  obvious  first  approach,  based  on  what  we  know,  is  the  integral  test.  Unfortunately, 
we  can't  compute  the  required  antiderivative.  But  looking  at  the  series,  it  would  appear 
that  it  must  converge,  because  the  terms  we  are  adding  are  smaller  than  the  terms  of  a 
p-series,  that  is, 

1  1 

< 


when  n  >  3.  Since  adding  up  the  terms  doesn't  get  "too  big",  the  new  series  "should" 
also  converge.  Let's  make  this  more  precise. 

OO  ^  OO  ^ 

The  series  }    converges  if  and  only  if  }    converges — all  we've  done  is 


'n?  Inn  "n?  Inn 

n=2  n=3 


OO  ^ 

dropped  the  initial  term.  We  know  that  y         converges.  Looking  at  two  typical  partial 


sums: 


1  1  1  1111  1_ 

~l~  7  "t~         1      IT  4"  ■  ■  ■  ~f"      ~^  ^    „o  ~l~  To  ~l~      2         '  '  '  "t"  2 


3^  In  3     4-^  In  4     5^  In  5  n^lnn      3^     4^     5^  n 

Since  the  p-series  converges,  say  to  L,  and  since  the  terms  are  positive,  tn  <  L.  Since  the 
terms  of  the  new  series  are  positive,  the  Sn  form  an  increasing  sequence  and  Sn  <  <  L 
for  all  n.  Hence  the  sequence  {sn}  is  bounded  and  so  converges.  □ 
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Sometimes,  even  when  the  integral  test  apphes,  comparison  to  a  known  series  is  easier, 
so  it's  generally  a  good  idea  to  think  about  doing  a  comparison  before  doing  the  integral 
test. 


OO      I      .  I 

smn 


EXAMPLE  11.5.2    Does  - — ^  converge? 


n=2 


We  can't  apply  the  integral  test  here,  because  the  terms  of  this  series  are  not  decreasing. 
Just  as  in  the  previous  example,  however. 


sinni  1 
< 


9        —  9 


because  |  sinn|  <  1.  Once  again  the  partial  sums  are  non-decreasing  and  bounded  above 
by       1/n^  =  L,  so  the  new  series  converges.  □ 

Like  the  integral  test,  the  comparison  test  can  be  used  to  show  both  convergence  and 
divergence.  In  the  case  of  the  integral  test,  a  single  calculation  will  confirm  whichever  is 
the  case.  To  use  the  comparison  test  we  must  first  have  a  good  idea  as  to  convergence  or 
divergence  and  pick  the  sequence  for  comparison  accordingly. 


OO  ^ 


EXAMPLE  11.5.3    Does  >    -^^=  converge? 

Vn'^  —3 

n=2  * 

We  observe  that  the  —3  should  have  little  effect  compared  to  the  inside  the  square 
root,  and  therefore  guess  that  the  terms  are  enough  like  l/Vri^  =  1/n  that  the  series 
should  diverge.  We  attempt  to  show  this  by  comparison  to  the  harmonic  series.  We  note 
that 

1  11 


> 


-3  n 


so  that 


1  1  1111 

Sn  =      ,  H  ,  +  •  •  •  H  ,  >  1  [-•••H  =  tn, 

2     3  n 

where  tn  is  1  less  than  the  corresponding  partial  sum  of  the  harmonic  series  (because  we 
start  at  n  =  2  instead  of  n  =  1).  Since  lim  tn  —  oo,  lim  Sn  =  oo  as  well.  □ 

n— >-oo  n— >-oo 

So  the  general  approach  is  this:  If  you  believe  that  a  new  series  is  convergent,  attempt 
to  find  a  convergent  series  whose  terms  are  larger  than  the  terms  of  the  new  series;  if  you 
believe  that  a  new  series  is  divergent,  attempt  to  find  a  divergent  series  whose  terms  are 
smaller  than  the  terms  of  the  new  series. 


11.5    Comparison  Tests  2T7 


EXAMPLE  11.5.4    Does  V  , 


converge? 


Just  as  in  the  last  example,  we  guess  that  this  is  very  much  like  the  harmonic  series 
and  so  diverges.  Unfortunately, 

1  1 

< 


^/v?  +  3  n 

so  we  can't  compare  the  series  directly  to  the  harmonic  series.  A  little  thought  leads  us  to 


1  1  1 

> 


v/n2T3  +  3n2  2n' 

so  if  ^  l/(2n)  diverges  then  the  given  series  diverges.  But  since  ^  l/(2n)  =  (1/2)  ^  1/n, 
theorem  11.2.2  implies  that  it  does  indeed  diverge.  □ 

For  reference  we  summarize  the  comparison  test  in  a  theorem. 

THEOREM  11.5.5  Suppose  that  and  hn  are  non-negative  for  all  n  and  that  a„  <  6„ 
when  n  >  N,  for  some  A^. 

oo  oo 

If       bn  converges,  so  does  a„. 

n=0  n=0 

oo  oo 

If  diverges,  so  does  6^. 

n=0  n=0 


Exercises  11.5. 

Determine  whether  the  series  converge  or  diverge. 

^'    S  2n2  +  3n  +  5  ^  ^'  ^  2n2  +  3n  -  5 

1  3n  +  4 

■    ^  2n2  -  3n  -  5  ^  '  ^  2n2  +  3n  +  5 

n=l  71=1 


oo 


^              3n  +  4      _^  g          In  n 

^  2n2  +  3n  +  5  ^  ' 

7.    >    —  ^  8.  >  

oo  oo 

9.  Y-^^  10.  y  ^ 


2"  +  5"  ^  2"  +  3^^ 
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11.6    Absolute  Convergence 

Roughly  speaking  there  are  two  ways  for  a  series  to  converge:  As  in  the  case  of  ^ 
the  individual  terms  get  small  very  quickly,  so  that  the  sum  of  all  of  them  stays  finite,  or, 
as  in  the  case  of  ^^(— 1)"^"^/^,  the  terms  don't  get  small  fast  enough  {Yl,l/n  diverges), 
but  a  mixture  of  positive  and  negative  terms  provides  enough  cancellation  to  keep  the 
sum  finite.  You  might  guess  from  what  we've  seen  that  if  the  terms  get  small  fast  enough 
to  do  the  job,  then  whether  or  not  some  terms  are  negative  and  some  positive  the  series 
converges. 

oo  oo 

THEOREM  11.6.1    If  ^  |  a„|  converges,  then       On  converges. 

n=0  n=0 

oo 

Proof.     Note  that  0  <  an  +  |an|  <  2|art|  so  by  the  comparison  test  ^^(an+|a„|)  converges. 

n=0 

Now 

oo  oo  oo  oo 

n=0  n=0  n=0  n=0 

converges  by  theorem  11.2.2.  ■ 

So  given  a  series  with  both  positive  and  negative  terms,  you  should  first  ask 

whether  X]  converges.  This  may  be  an  easier  question  to  answer,  because  we  have 
tests  that  apply  specifically  to  terms  with  non-negative  terms.  If  ^  \an\  converges  then 
you  know  that  Yl'^n  converges  as  well.  If  ^  |a^|  diverges  then  it  still  may  be  true  that 
converges — you  will  have  to  do  more  work  to  decide  the  question.  Another  way  to 
think  of  this  result  is:  it  is  (potentially)  easier  for  ^  a„,  to  converge  than  for  ^  \an\  to 
converge,  because  the  latter  series  cannot  take  advantage  of  cancellation. 

If  ^  la^l  converges  we  say  that  ^  converges  absolutely;  to  say  that  ^  converges 
absolutely  is  to  say  that  any  cancellation  that  happens  to  come  along  is  not  really  needed, 
as  the  terms  already  get  small  so  fast  that  convergence  is  guaranteed  by  that  alone.  If 
^  On  converges  but  does  not,  we  say  that  X] '^n  converges  conditionally.  For 

oo  ^  oo  ^ 

example  y  (—1)""^^  converges  absolutely,  while  y  (  —  1)"^"^—  converges  conditionally. 


oo 

smn 


EXAMPLE  11.6.2     Does  V  — ^  converge? 
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OO      I     .  I 

Sinn 


\     sill  IL\ 

In  example  11.5.2  we  saw  that  >   ^ —  converges,  so  the  given  series  converges  abso- 

71  =  2 

lutely.  □ 


°°               3n  +  4 
EXAMPLE  11.6.3     Does  S^i-lY^ —   converge? 

T1  =  0 

Taking  the  absolute  value,  — w——          diverges  by  comparison  to       — so  if  the 

n=0  n=l 

series  converges  it  does  so  conditionally.  It  is  true  that  lim  {3n  +  A)/{2n^  +  3n  +  5)  =  0, 

n— >oo 

SO  to  apply  the  alternating  series  test  we  need  to  know  whether  the  terms  are  decreasing. 
If  we  let  f{x)  =  (3x  +  A)/{2x^  +  3x  +  5)  then  f{x)  =  -{6x^  +  16x  -  3)/{2x^  +  3x  +  5)^ 
and  it  is  not  hard  to  see  that  this  is  negative  for  x  >  1,  so  the  series  is  decreasing  and  by 
the  alternating  series  test  it  converges.  □ 

Exercises  11.6. 

Determine  whether  each  series  converges  absolutely,  converges  conditionally,  or  diverges. 

1.  ,   ^  2.  Vr-i)"-^- 


2n2  +  3n  +  5  '      2n2  +  3n  +  5 


CXD  ,  CXD  , 

linn  ^     V^/  ixn-iln^ 

3~ 


3-  ^-  B-i 

OO  ^  OO  „, 

5-  E(-iru^^  6.  5:(-ir 


Inn  '  2" +  5" 

ra=2  n=0 

OO  „^  OO 

7    V^/  g  ^^n-iarctann 

n=0  n=l 


11.7    The  Ratio  and  Root  Tests 

Does  the  series        ^  converge?  It  is  possible,  but  a  bit  unpleasant,  to  approach  this 

with  the  integral  test  or  the  comparison  test,  but  there  is  an  easier  way.  Consider  what 
happens  as  we  move  from  one  term  to  the  next  in  this  series: 

(n  +  1)^ 
■■■  +  —  +    ,  ^/  H  

5n  5n+l 

The  denominator  goes  up  by  a  factor  of  5,  =  5  ■  5",  but  the  numerator  goes  up  by 

much  less:  {n  +  1)^  =  +  hn^  +  lOn'^  +  lOn^  +  5n  +  1,  which  is  much  less  than  5n^  when 
n  is  large,  because  5n*  is  much  less  than  rC' .  So  we  might  guess  that  in  the  long  run  it 
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begins  to  look  as  if  each  term  is  1  /5  of  the  previous  term.  We  have  seen  series  that  behave 
hke  this: 

n=0 

a  geometric  series.  So  we  might  try  comparing  the  given  series  to  some  variation  of  this 
geometric  series.  This  is  possible,  but  a  bit  messy.  We  can  in  effect  do  the  same  thing, 
but  bypass  most  of  the  unpleasant  work. 
The  key  is  to  notice  that 

a„+i             {n  +  lfb^             (n  +  l)^l  1  1 

iim    =  iim   —          =  iim  =  !■-  =  -. 

n->-oo    an         n->-oo     5"'"'"'''     n°       n^oo  5  5  5 

This  is  really  just  what  we  noticed  above,  done  a  bit  more  officially:  in  the  long  run,  each 
term  is  one  fifth  of  the  previous  term.  Now  pick  some  number  between  1/5  and  1,  say  1/2. 
Because 

hm   =  -, 

n— >-oo    Uri  O 


then  when  n  is  big  enough,  say  n  >      for  some  N, 

  <  -    and    ttn+i  <  — • 

an       2  2 

So  Oiv+i  <  aAr/2,  aN+2  <  OAr+i/2  <  aAr/4,  gn+s  <  OAr+2/2  <  aAr+i/4  <  gn/S,  and  so 
on.  The  general  form  is  ajv+A;  <  a^/'^^  ■  So  if  we  look  at  the  series 

00 

OAT+fe  —  aN  -\-  OAT+I  +  OAr+2  +  OAT+S  +  •  •  •  +  OAT+fc  +  •  •  •  5 

fc=0 

its  terms  are  less  than  or  equal  to  the  terms  of  the  sequence 

00 

00  00 

So  by  the  comparison  test,  ^^ojv+fe  converges,  and  this  means  that       o„  converges, 

fe=0  n=0 

since  we've  just  added  the  fixed  number  ao  +  ai  +  •  •  •  +  aN-i- 

Under  what  circumstances  could  we  do  this?  What  was  crucial  was  that  the  limit  of 
a„+i/a„,  say  L,  was  less  than  1  so  that  we  could  pick  a  value  r  so  that  L  <  r  <  1.  The 
fact  that  L  <  r  (1/5  <  1/2  in  our  example)  means  that  we  can  compare  the  series  J^a^ 
to  ^r'^i  and  the  fact  that  r  <  1  guarantees  that  J^r"  converges.  That's  really  all  that  is 
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required  to  make  the  argument  work.  We  also  made  use  of  the  fact  that  the  terms  of  the 
series  were  positive;  in  general  we  simply  consider  the  absolute  values  of  the  terms  and  we 
end  up  testing  for  absolute  convergence. 

THEOREM  11.7.1     The  Ratio  Test      Suppose  that  lim  \an+i/a^\  =  L.  If  L  <  1 

the  series  ^  converges  absolutely,  if  L  >  1  the  series  diverges,  and  if  L  =  1  this  test 
gives  no  information. 

Proof.  The  example  above  essentially  proves  the  first  part  of  this,  if  we  simply  replace 
1/5  by  L  and  1/2  by  r.  Suppose  that  L  >  1,  and  pick  r  so  that  1  <  r  <  L.  Then  for 
n>  N,  for  some  N, 

\ttn+l\ 


\an\ 


>  r    and    |a„-|-i|  >  r|a„| 


This  implies  that  |aiv+fc|  >  r  \aN\,  but  since  r  >  1  this  means  that  lim  |ajv+A;|  7^  0,  which 

fc— >-oo 

means  also  that  lim      7^  0.  By  the  divergence  test,  the  series  diverges. 

n— >-oo 

To  see  that  we  get  no  information  when  L  =  1,  we  need  to  exhibit  two  series  with 
L  =  1,  one  that  converges  and  one  that  diverges.  It  is  easy  to  see  that  ^  1/n^  and  ^  1/n 
do  the  job.  ■ 

EXAMPLE  11.7.2    The  ratio  test  is  particularly  useful  for  series  involving  the  factorial 

00 

function.  Consider  ^^5'^/n!. 

n=0 

lim  — —  =  lim  —  =  lim  5 


rn-oo  (n  +  1)!  5"-       n^oo    5"    (n  +  1)!       n^oo    (n  +  1) 

Since  0  <  1,  the  series  converges.  □ 

A  similar  argument,  which  we  will  not  do,  justifies  a  similar  test  that  is  occasionally 
easier  to  apply. 

THEOREM  11.7.3    The  Root  Test     Suppose  that  lim  lan]^/""  =  L.  If  L  <  1  the 

n— >oo 

series  a„  converges  absolutely,  if  L  >  1  the  series  diverges,  and  if  L  =  1  this  test  gives 
no  information.  ■ 

The  proof  of  the  root  test  is  actually  easier  than  that  of  the  ratio  test,  and  is  a  good 
exercise. 


EXAMPLE  11.7.4    Analyze  V  — 


n=0 
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The  ratio  test  turns  out  to  be  a  bit  difficult  on  this  series  (try  it).  Using  the  root  test: 
hm     —         =  hm  - — —j—  =  nm  —  =  0. 

n— >-cxD  \  77,"'  /  n^cxD  (77,"')^/"'        n—>-oo  77, 

Since  0  <  1,  the  series  converges.  □ 

The  root  test  is  frequently  useful  when  n  appears  as  an  exponent  in  the  general  term 
of  the  series. 

Exercises  11.7. 

1.  Compute  lim  |a„-|-i/o„|  for  the  series  ^ 

2.  Compute  lim  |an+i/a„|  for  the  series  ^  1/n. 

n — >oo 

3.  Compute  lim  janl^^""  for  the  series  ^^/n^- 

n — >oo 

4.  Compute  lim  janl^^""  for  the  series  X^l/n. 

n — >OG 

Determine  whether  the  series  converge. 

5.  6. 

n=0  n=l 


OO 


.     ^  (n!)^ 


9.  Prove  theorem  11.7.3,  the  root  test. 


11.8    Power  Series 

Recall  that  we  were  able  to  analyze  all  geometric  series  "simultaneously"  to  discover  that 

n=0 

if  |x|  <  1,  and  that  the  series  diverges  when  \x\  >  1.  At  the  time,  we  thought  of  x  as  an 
unspecified  constant,  but  we  could  just  as  well  think  of  it  as  a  variable,  in  which  case  the 
series 


is  a  function,  namely,  the  function  k/{l  —  x),  as  long  as  |x|  <  1.  While  k/{l  —  x)  is  a  rea- 
sonably easy  function  to  deal  with,  the  more  complicated  ^  kx'^  does  have  its  attractions: 
it  appears  to  be  an  infinite  version  of  one  of  the  simplest  function  types — a  polynomial. 
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This  leads  naturally  to  the  questions:  Do  other  functions  have  representations  as  series? 
Is  there  an  advantage  to  viewing  them  in  this  way? 

The  geometric  series  has  a  special  feature  that  makes  it  unlike  a  typical  polynomial — 
the  coefficients  of  the  powers  of  x  are  the  same,  namely  k.  We  will  need  to  allow  more 
general  coefficients  if  we  are  to  get  anything  other  than  the  geometric  series. 

DEFINITION  11.8.1    A  power  series  has  the  form 

CO 

n=0 

with  the  understanding  that  a„  may  depend  on  n  but  not  on  x.  □ 


EXAMPLE  11.8.2  —  is  a  power  series.  We  can  investigate  convergence  using  the 

n 

n=l 

ratio  test: 


a;         n              ,  ,    n        ,  , 
lim  — —  =  lim  \x\  =  \x\. 

n->-oo  n+1  \x\'"'       n-i-oo  n+1 

Thus  when  |a;|  <  1  the  series  converges  and  when  >  1  it  diverges,  leaving  only  two  values 
in  doubt.  When  x  =  1  the  series  is  the  harmonic  series  and  diverges;  when  a;  =  —1  it  is  the 
alternating  harmonic  series  (actually  the  negative  of  the  usual  alternating  harmonic  series) 

and  converges.  Thus,  we  may  think  of       —  as  a  function  from  the  interval  [—1, 1)  to 

n=l 

the  real  numbers.  □ 

A  bit  of  thought  reveals  that  the  ratio  test  applied  to  a  power  series  will  always  have 
the  same  nice  form.  In  general,  we  will  compute 

lim  — I — j-j — I        =  lim  \x\— — —  =  \x\  lim  —  —  =  L,\x\, 

n— >-oo       [a^lpl"'  n— >-oo         \(ln\  n— >-cxd  \an\ 

assuming  that  lim  |aTj,+i|/|an|  exists.  Then  the  series  converges  if  L\x\  <  1,  that  is,  if 
<  1/L,  and  diverges  if  >  1/L.  Only  the  two  values  x  —  ±1/L  require  further 
investigation.  Thus  the  series  will  definitely  define  a  function  on  the  interval  {—1/L,  1/L), 
and  perhaps  will  extend  to  one  or  both  endpoints  as  well.  Two  special  cases  deserve 
mention:  if  L  =  0  the  limit  is  0  no  matter  what  value  x  takes,  so  the  series  converges  for 
all  X  and  the  function  is  defined  for  all  real  numbers.  If  L  =  oo,  then  no  matter  what 
value  X  takes  the  limit  is  infinite  and  the  series  converges  only  when  x  =  0.  The  value  1/L 
is  called  the  radius  of  convergence  of  the  series,  and  the  interval  on  which  the  series 
converges  is  the  interval  of  convergence. 
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Consider  again  the  geometric  series, 


oo 


n=0 

Whatever  benefits  there  might  be  in  using  the  series  form  of  this  function  are  only  avail- 
able to  us  when  x  is  between  —1  and  1.  Frequently  we  can  address  this  shortcoming  by 
modifying  the  power  series  slightly.  Consider  this  series: 

3n  3    )  ~l-^~l-x' 

n=0  n=0  ^  '  o 

because  this  is  just  a  geometric  series  with  x  replaced  by  {x  +  2)/3.  Multiplying  both  sides 
by  1/3  gives 

{x  +  2)"  _  1 


E 


3n+i 

n=0 

the  same  function  as  before.  For  what  values  of  x  does  this  series  converge?  Since  it  is  a 
geometric  series,  we  know  that  it  converges  when 

|a;  +  2|/3  <  1 
|a;  +  2|  <  3 
-3  <  x  +  2  <  3 
-5  <  a;  <  1. 


So  we  have  a  series  representation  for  1/(1  —  a;)  that  works  on  a  larger  interval  than  before, 
at  the  expense  of  a  somewhat  more  complicated  series.  The  endpoints  of  the  interval  of 
convergence  now  are  —5  and  1,  but  note  that  they  can  be  more  compactly  described  as 
—2  ±  3.  We  say  that  3  is  the  radius  of  convergence,  and  we  now  say  that  the  series  is 
centered  at  —2. 

DEFINITION  11.8.3    A  power  series  centered  at  a  has  the  form 

oo 

y^^anjx  -  a)'^, 

n=0 


with  the  understanding  that  a„  may  depend  on  n  but  not  on  x. 


□ 


11.9    Calculus  with  Power  Series  285 


Exercises  11.8. 

Find  the  radius  and  interval  of  convergence  for  each  series.  In  exercises  3  and  4,  do  not  attempt 
to  determine  whether  the  endpoints  are  in  the  interval  of  convergence. 


1.    ^nx"^  2.  J2 

OO  .  OO  . 

3.    >    — X    =^  4.    >    — (a;  —  2)  =^> 

n=l  n=l 

^  ^  n(n  +  1) 

11.9    Calculus  with  Power  Series 

Now  we  know  that  some  functions  can  be  expressed  as  power  series,  which  look  hke  infinite 
polynomials.  Since  calculus,  that  is,  computation  of  derivatives  and  antiderivatives,  is  easy 
for  polynomials,  the  obvious  question  is  whether  the  same  is  true  for  infinite  series.  The 
answer  is  yes: 


THEOREM  11.9.1     Suppose  the  power  series  f{x)  =  ^^a^(x  —  a)"^  has  radius  of 

n=0 

convergence  K.  Then 


n=0 

f{x)dx  =  C+J2^ 


nan (X  —  a 

n=0 

OO 


n=0 

and  these  two  series  have  radius  of  convergence  R  as  well. 
EXAMPLE  11.9.2     Starting  with  the  geometric  series: 

^  CXD 

—  =  Vx" 


1  —  X 

n=0 


/   rfx  =  -  In  11  -  x|  =  V  

J  l-x  +  1 

71  =  0 

OO  ^ 

lull  -x|  =  V  x"+^ 

'         '      ^  n+1 

n=0 


when  |x|  <  1.  The  series  does  not  converge  when  x  =  1  but  does  converge  when  x  =  —  1 
or  1  —  X  =  2.  The  interval  of  convergence  is  [—1, 1),  or  0  <  1  —  x  <  2,  so  we  can  use  the 
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series  to  represent  ln(x)  when  0  <  x  <  2.  For  example 

^  11 


ln(3/2)  =  ln(l--l/2)  =  5^(-l 


n  +  1  2^^+! 

n=0 

and  so 

,,,,1111        1         1         1  909 

In  3/2  ^  H  H  H  =   ^  0.406. 

^  '  ^      2     8     24     64     160     384     896  2240 

Because  this  is  an  alternating  series  with  decreasing  terms,  we  know  that  the  true  value 
is  between  909/2240  and  909/2240  -  1/2048  =  29053/71680  ^  .4053,  so  correct  to  two 
decimal  places  the  value  is  0.41. 

What  about  In (9/4)?  Since  9/4  is  larger  than  2  we  cannot  use  the  series  directly,  but 

ln(9/4)  =  ln((3/2)2)  =  21n(3/2)  ^  0.82, 

so  in  fact  we  get  a  lot  more  from  this  one  calculation  than  first  meets  the  eye.  To  estimate 
the  true  value  accurately  we  actually  need  to  be  a  bit  more  careful.  When  we  multiply  by 
two  we  know  that  the  true  value  is  between  0.8106  and  0.812,  so  rounded  to  two  decimal 
places  the  true  value  is  0.81.  □ 

Exercises  11.9. 

1.  Find  a  series  representation  for  In  2.  =^ 

2.  Find  a  power  series  representation  for  1/(1  —  x)^ .  =^> 

3.  Find  a  power  series  representation  for  2/(1  —  x)^ .  =^> 

4.  Find  a  power  series  representation  for  1/(1  —  x)^ .  What  is  the  radius  of  convergence?  =^> 

5.  Find  a  power  series  representation  for  /  ln(l  —  x)  dx. 


11.10    Taylor  Series 

We  have  seen  that  some  functions  can  be  represented  as  series,  which  may  give  valuable 
information  about  the  function.  So  far,  we  have  seen  only  those  examples  that  result  from 
manipulation  of  our  one  fundamental  example,  the  geometric  series.  We  would  like  to  start 
with  a  given  function  and  produce  a  series  to  represent  it,  if  possible. 

oo 

Suppose  that  f{x)  =       a^x"  on  some  interval  of  convergence.  Then  we  know  that 

n=0 

we  can  compute  derivatives  of  /  by  taking  derivatives  of  the  terms  of  the  series.  Let's  look 
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at  the  first  few  in  general: 

oo 

f'{x)  —       nanx'^~^  =  ai  +  2a2X  +  Sa^x^  +  4:a4X^  +  ■  ■  ■ 

n=l 

oo 

f"{x)  =  ^  n(n  -  1)0^2;'""^  =  2a2  +  3  •  2030;  +  4  •  Za^x'^  +  •  • 

n=2 

00 

f"'[x)  =  ^n{n-  l){n  -  2)anx'^-^  =  3  •  203  +  4  •  3  •  2040;  + 


n=3 


By  examining  these  it's  not  hard  to  discern  the  general  pattern.  The  A;th  derivative  must 
be 


fi^){x)  =  ^  n(n  -  l)(n  -  2)  •  •  •  (n  -  /c  +  l)a, 

n=fe 

=  k{k  -  l){k  -  2)  ■  ■  ■  (2)(l)afc  +  (/c  +  l)(/e)  •  ■  •  {2)ak+ix  + 
+  (/c  +  2)(/c +  !)•••  (3)afc+2X^  +  --- 
We  can  shrink  this  quite  a  bit  by  using  factorial  notation: 

/W(a;)  =  ^  ^L^anx"-'^  =  k\ak  +  {k+  l)\ak+ix  +  ^^y^afe+aa;^  +  •  •  • 

Now  substitute  a;  =  0: 

and  solve  for  a^: 

^^  =  ^^- 

Note  the  special  case,  obtained  from  the  series  for  /  itself,  that  gives  /(O)  =  ao- 

So  if  a  function  /  can  be  represented  by  a  series,  we  know  just  what  series  it  is.  Given 
a  function  /,  the  series 


E 

is  called  the  Maclaurin  series  for  /. 


00  .rn)(Q^ 


n! 

n=0 
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EXAMPLE  11.10.1  Find  the  Maclaurin  series  for  f{x)  =  1/(1  -  x).  We  need  to 
compute  the  derivatives  of  /  (and  hope  to  spot  a  pattern). 


/(^) 

=  (1- 

=  (1- 

fix) 

=  2(1- 

f"'{x) 

=  6(1  - 

=  4!(1 

So 


and  the  Maclaurin  series  is 


/^")(x)  =n!(l-x)-^-^ 
/(")(0)  n!(l-0)-"-i 


n=0  n=0 

the  geometric  series.  □ 

A  warning  is  in  order  here.  Given  a  function  /  we  may  be  able  to  compute  the 
Maclaurin  series,  but  that  does  not  mean  we  have  found  a  series  representation  for  /.  We 
still  need  to  know  where  the  series  converges,  and  if,  where  it  converges,  it  converges  to 
f{x).  While  for  most  commonly  encountered  functions  the  Maclaurin  series  does  indeed 
converge  to  /  on  some  interval,  this  is  not  true  of  all  functions,  so  care  is  required. 

As  a  practical  matter,  if  we  are  interested  in  using  a  series  to  approximate  a  function, 
we  will  need  some  finite  number  of  terms  of  the  series.  Even  for  functions  with  messy 
derivatives  we  can  compute  these  using  computer  software  like  Sage.  If  we  want  to  know  the 
whole  series,  that  is,  a  typical  term  in  the  series,  we  need  a  function  whose  derivatives  fall 
into  a  pattern  that  we  can  discern.  A  few  of  the  most  important  functions  are  fortunately 
very  easy. 

EXAMPLE  11.10.2     Find  the  Maclaurin  series  for  sinx. 

The  derivatives  are  quite  easy:  f'{x)  =  cosx,  f"{x)  =  —sinx,  f"'{x)  =  —cosx, 
f^'^\x)  —  sinx,  and  then  the  pattern  repeats.  We  want  to  know  the  derivatives  at  zero:  1, 
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0,  —1,  0,  1,  0,  —1,  0,. . . ,  and  so  the  Maclaurin  series  is 


X  r  +  -r  =>    (-1,  , 

3!      5!  '    2n  +  l 

n=0  ^ 

We  should  always  determine  the  radius  of  convergence: 


|^|2n+3    (2n  +  l)!         ,  IxP 

lim  —  - — - — =  lim  —  ^  =  0, 

n^oo(2n  +  3)!  n^oo  (27i  +  3)(2n  +  2) 


so  the  series  converges  for  every  x.  Since  it  turns  out  that  this  series  does  indeed  converge 
to  sin  a;  everywhere,  we  have  a  series  representation  for  sin  a:  for  every  x.  Here  is  an 
interactive  plot  of  the  sine  and  some  of  its  series  approximations.  □ 

Sometimes  the  formula  for  the  nth  derivative  of  a  function  /  is  difficult  to  discover, 
but  a  combination  of  a  known  Maclaurin  series  and  some  algebraic  manipulation  leads 
easily  to  the  Maclaurin  series  for  /. 

EXAMPLE  11.10.3    Find  the  Maclaurin  series  for  a;sin(— x). 

To  get  from  sin  a;  to  x  sin(— a;)  we  substitute  —x  for  x  and  then  multiply  by  x.  We  can 
do  the  same  thing  to  the  series  for  sinx: 

n=0  ^  '  n=0  ^  '         n=0  ^  ' 

□ 

As  we  have  seen,  a  general  power  series  can  be  centered  at  a  point  other  than  zero, 
and  the  method  that  produces  the  Maclaurin  series  can  also  produce  such  series. 

EXAMPLE  11.10.4    Find  a  series  centered  at  -2  for  1/(1  -  x). 

CO 

If  the  series  is       a-nix  +  2)"^  then  looking  at  the  /cth  derivative: 

n=0 

fc!(l-x)-'=-i  =  5^^^^^a.(x  +  2r-'= 

and  substituting  a:  =  — 2  we  get  ^13"^^"^  =  /c!afc  and      =  3"^^"^  =  1/3'^"'"^,  so  the  series  is 

A  (x  +  2)" 

n=0 

We've  already  seen  this,  on  page  284.  □ 
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Such  a  series  is  called  the  Taylor  series  for  the  function,  and  the  general  term  has 
the  form 


n! 


fa; -a)". 


A  Maclaurin  series  is  simply  a  Taylor  series  with  a  =  0. 
Exercises  11.10. 

For  each  function,  find  the  Maclaurin  series  or  Taylor  series  centered  at  a,  and  the  radius  of 
convergence. 

1.  cosx  =^ 

2.  =^ 

3.  l/x,  a  =  5  ^ 

4.  Inx,  a  =  f  =^> 

5.  \nx,  a  =  2  =^> 

6.  f /x^,  a  =  f 


7.  =^ 

8.  Find  the  first  four  terms  of  the  Maclaurin  series  for  tana;  (up  to  and  including  the  term). 

=^ 

9.  Use  a  combination  of  Maclaurin  series  and  algebraic  manipulation  to  find  a  series  centered 
at  zero  for  a;cos(x^). 

10.  Use  a  combination  of  Maclaurin  series  and  algebraic  manipulation  to  find  a  series  centered 
at  zero  for  xe^^ .  =^ 


11.11    Taylor's  Theorem 

One  of  the  most  important  uses  of  infinite  series  is  the  potential  for  using  an  initial  portion 
of  the  series  for  /  to  approximate  /.  We  have  seen,  for  example,  that  when  we  add  up  the 
first  n  terms  of  an  alternating  series  with  decreasing  terms  that  the  difference  between  this 
and  the  true  value  is  at  most  the  size  of  the  next  term.  A  similar  result  is  true  of  many 
Taylor  series. 

THEOREM  11.11.1  Suppose  that  /  is  defined  on  some  open  interval  /  around  a  and 
suppose  f^^~^^\x)  exists  on  this  interval.  Then  for  each  x  7^  a  in  /  there  is  a  value  z 
between  x  and  a  so  that 

n=0        ■  V      '  y- 
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Proof.  The  proof  requires  some  cleverness  to  set  up,  but  then  the  details  are  quite 
elementary.  We  want  to  define  a  function  F{t).  Start  with  the  equation 

F{t)  =      ^—r^    -     +  B{x  -  t)^+\ 

n=0  ^' 

Here  we  have  replaced  a  by  i  in  the  first  +  1  terms  of  the  Taylor  series,  and  added  a 
carefully  chosen  term  on  the  end,  with  B  to  be  determined.  Note  that  we  are  temporarily 
keeping  x  fixed,  so  the  only  variable  in  this  equation  is  t,  and  we  will  be  interested  only  in 
t  between  a  and  x.  Now  substitute  t  =  a: 

F{a)  =       ^^^^  {x  -  ar  +  B{x  -  a)^+^ 

n=0 

Set  this  equal  to  f{x): 

f{x)  =  y  ^—p-  {x  -  aY  +  B{x  -  a)^+i. 

n=0 

Since  x  ^  a,  we  can  solve  this  for  B,  which  is  a  "constant" — it  depends  on  x  and  a  but 
those  are  temporarily  fixed.  Now  we  have  defined  a  function  F{t)  with  the  property  that 
F(a)  =  f{x).  Consider  also  F{x):  all  terms  with  a  positive  power  of  [x  —  t)  become  zero 
when  we  substitute  x  for  t,  so  we  are  left  with  F{x)  =  f^^\x)/0\  =  f{x).  So  F{t)  is  a 
function  with  the  same  value  on  the  endpoints  of  the  interval  [a,x].  By  RoUe's  theorem 
(6.5.1),  we  know  that  there  is  a  value  z  G  (a,  x)  such  that  F'{z)  =  0.  Let's  look  at  F'(t). 
Each  term  in  F(t),  except  the  first  term  and  the  extra  term  involving  B,  is  a  product,  so 
to  take  the  derivative  we  use  the  product  rule  on  each  of  these  terms.  It  will  help  to  write 
out  the  first  few  terms  of  the  definition: 
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Now  take  the  derivative: 


tn- 

-1)+ 

1! 

t)\- 

-1)+ 

2! 

tn- 

-1)+ 

3! 

(Ar-l)!^"" 

-tr-\- 

-1)+  ^ 

+  B{N  +  l){x-tf{-l). 
Now  most  of  the  terms  in  this  expression  cancel  out,  leaving  just 

F'{t)  =  ^-^^^^^{x  -       +  B{N  +  l){x  -  tf{-l). 


At  some  z,  F'{z)  =  0  so 


0  =  l^!^^^{x  -  z)^  +  B{N  +  l){x  -  ^)^(-l) 


B{N  +  l){x  -        =  '    ^,y{x  -  z) 


Now  we  can  write 


N 


(iV+1)!  • 


n=0 


Recalling  that  F{a)  =  f{x)  we  get 

/(^)  =  - +  fiv  +  i)i  ' 

n=0         ■  V       '  /• 

which  is  what  we  wanted  to  show.  ■ 

It  may  not  be  immediately  obvious  that  this  is  particularly  useful;  let's  look  at  some 
examples. 


11.11    Taylor's  Theorem  293 


EXAMPLE  11.11.2    Find  a  polynomial  approximation  for  sinx  accurate  to  ±0.005. 
From  Taylor's  theorem: 


^  c(n) 


smx 


n=0 


ni 


(N  +  iy. 


What  can  we  say  about  the  size  of  the  term 


(N  +  l) 


-{x  —  a) 


Every  derivative  of  sinx  is  ±sinx  or  icosx,  so  \  f^^~^^\z)\  <  1.  The  factor  {x  —  a)^~^^  is 
a  bit  more  difficult,  since  x  —  a  could  be  quite  large.  Let's  pick  a  =  0  and  |x|  <  7r/2;  if  we 
can  compute  sinx  for  x  G  [— 7r/2, 7r/2],  we  can  of  course  compute  sinx  for  all  x. 
We  need  to  pick      so  that 


X 


N+l 


(N  +  iy. 


<  0.005. 


Since  we  have  limited  x  to  [— 7r/2, 7r/2], 


X 


(N  +  iy. 


)N+1 


< 


(N  +  iy. 


The  quantity  on  the  right  decreases  with  increasing  A^,  so  all  we  need  to  do  is  find  an 
so  that 

2N+1 

<  0.005. 


(N  +  iy 

A  little  trial  and  error  shows  that  N  =  8  works,  and  in  fact  2^/9!  <  0.0015,  so 


smx 


f/i!M,~±  0.0015 


n=0 


ni 


X  h 


X 


±0.0015. 


6      120  5040 

Figure  11.11.1  shows  the  graphs  of  sinx  and  and  the  approximation  on  [0,  37r/2].  As  x 
gets  larger,  the  approximation  heads  to  negative  infinity  very  quickly,  since  it  is  essentially 
acting  like  —  x^.  □ 
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1 


Figure  11.11.1     sinx  and  a  polynomial  approximation.  (AP) 
We  can  extract  a  bit  more  information  from  this  example.  If  we  do  not  limit  the  value 


of  a;,  we  still  have 


/(iv+i)(^) 


so  that  sinx  is  represented  by 


(N  +  l) 


^  c(n) 


-X 


N+l 


< 


X 


N+l 


(N+l) 


n=0 


ni 


X 


N+l 


{N  +  l)l 


If  we  can  show  that 


for  each  x  then 


lim 

N^oo 


X 


N+l 


smx 


ni 


(N  +  iy. 

oo 


X 


2n+l 


{2n  +  l] 


n=0  n=0 

that  is,  the  sine  function  is  actually  equal  to  its  Maclaurin  series  for  all  x.  How  can  we 
prove  that  the  limit  is  zero?  Suppose  that  is  larger  than  |a;|,  and  let  M  be  the  largest 
integer  less  than      (if  M  =  0  the  following  is  even  easier).  Then 


\x 


N+l\ 


\X\ 


\X\ 


2  1 


< 


\x\ 


N  +  l 


■  1  ■  ■ 

M 


M  M-1 


2  1 


The  quantity  \x\    /Ml  is  a  constant,  so 


lim 


M 


N^oo  N  +1  Ml 
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and  by  the  Squeeze  Theorem  (11.1.3) 


X 

(N  +  iy. 


as  desired.  EssentiaUy  the  same  argument  works  for  cos  a;  and  e^;  unfortunately,  it  is  more 
difficult  to  show  that  most  functions  are  equal  to  their  Maclaurin  series. 

EXAMPLE  11.11.3  Find  a  polynomial  approximation  for  near  x  =  2  accurate  to 
±0.005. 

From  Taylor's  theorem: 

N 


e 

n=0 


since  f^^\x)  =  for  all  n.  We  are  interested  in  x  near  2,  and  we  need  to  keep  — 2)"^^^! 
in  check,  so  we  may  as  well  specify  that  |a;  —  2|  <  1,  so  x  G  [1,  3].  Also 


(iV  +  1)! 


e3 

< 


(iV  +  1)!' 


so  we  need  to  find  an  that  makes  e^/{N  +  1)!  <  0.005.  This  time  N  =  5  makes 
e^/{N  +  1)!  <  0.0015,  so  the  approximating  polynomial  is 

2  2  2  2 

=     +  e\x  -  2)  +  —(x  -2f  +  -(x~  2f  +  — (x  -  2)^  +  — (x  -  2)^  ±  0.0015. 
V  ^       6^        ^       24^        ^       120^  ^ 

This  presents  an  additional  problem  for  approximation,  since  we  also  need  to  approximate 
e^,  and  any  approximation  we  use  will  increase  the  error,  but  we  will  not  pursue  this 
complication.  □ 

Note  well  that  in  these  examples  we  found  polynomials  of  a  certain  accuracy  only  on 
a  small  interval,  even  though  the  series  for  sinx  and  converge  for  all  x;  this  is  typical. 
To  get  the  same  accuracy  on  a  larger  interval  would  require  more  terms. 

Exercises  11.11. 

1.  Find  a  polynomial  approximation  for  cosx  on  [0,7r],  accurate  to  ±10^^  =^> 

2.  How  many  terms  of  the  series  for  Inx  centered  at  1  are  required  so  that  the  guaranteed  error 
on  [1/2,3/2]  is  at  most  10"^?  What  if  the  interval  is  instead  [1,3/2]?  =^ 

3.  Find  the  first  three  nonzero  terms  in  the  Taylor  series  for  tan  a;  on  [— 7r/4, 7r/4],  and  compute 
the  guaranteed  error  term  as  given  by  Taylor's  theorem.  (You  may  want  to  use  Sage  or  a 
similar  aid.) 
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4.  Show  that  cosx  is  equal  to  its  Taylor  series  for  all  x  by  showing  that  the  limit  of  the  error 
term  is  zero  as  A''  approaches  infinity. 

5.  Show  that      is  equal  to  its  Taylor  series  for  all  x  by  showing  that  the  limit  of  the  error  term 
is  zero  as  A'^  approaches  infinity. 

11.12    Additional  exercises 

These  problems  require  the  techniques  of  this  chapter,  and  are  in  no  particular  order.  Some 
problems  may  be  done  in  more  than  one  way. 

Determine  whether  the  series  converges. 

oo 

1-  E 


oo 

n 


+  4 

1  1  1  1 


3-  E 


oo 

n 


{v?  +  4)2 


n=0 

oo 

n=0 


..      3     5      7  9 

^-4  +  8-12  +  16+---^ 


oo  ^ 


7_  (n)  ^ 


n=0 

oo 


E 


Tl  =  0 
OO 


n 


9    V  -  

^„  1  ■  3  •  5  ■  •  •  (2n  -  1) 


n=0 
oo 


10. 


n  =  l 


_         1  2  3  4 

11.  ^  \  \  h 

2-3-4     3-4-5     4-5-6  5-6-7 

12.  ^il3-5---(2n-l)  _ 

n=l 
oo 

13.  E 

14.  t  ^-'y 


(2n)! 
6" 


n! 

n=0 

\n-l 


Jn 
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15.  j: 


n  =  l 


•     ^  22  ^  (2  •  4)2  +  (2  •  4  •  6)2  ^  (2  •  4  •  6  •  8)2  ^  ' 


17.  ^sin(l/r 


n  =  l 


Find  the  interval  and  radius  of  convergence;  you  need  not  check  the  endpoints  of  the  intervals. 

18.  y  ^-x-  ^ 

n=0 


"•El 


X 


n=0 

oo 


3^' 


20.  y 


n  =  l 


X 

n3' 


Ix^      1  •  3         1  •  3  •  5 

21.  x-\  \  \  h 

2  3      2-4  5      2-4-6  7 


n!  ^ 

X 


22.  y 

n  =  l 

24. 


n2 

n  =  l 


n! 

n=0 

Find  a  series  for  each  function,  using  the  formula  for  Maclaurin  series  and  algebraic  manipulation 
as  appropriate. 

25.  2"=  ^ 

26.  ln(l  +  x)  ^ 

27.  ln'^  +  ^^ 


1  —  X 

28.  Vl  +  a;  ^ 

29.  ' 


1  +  X2 

30.  arctan(x)  =^ 

31.  Use  the  answer  to  the  previous  problem  to  discover  a  series  for  a  well-known  mathematical 
constant.  =^ 


Three 


Dimensions 


12.1    The  Coordinate 


System 


So  far  we  have  been  investigating  functions  of  the  form  y  =  f{x),  with  one  independent  and 
one  dependent  variable.  Such  functions  can  be  represented  in  two  dimensions,  using  two 
numerical  axes  that  allow  us  to  identify  every  point  in  the  plane  with  two  numbers.  We  now 
want  to  talk  about  three-dimensional  space;  to  identify  every  point  in  three  dimensions 
we  require  three  numerical  values.  The  obvious  way  to  make  this  association  is  to  add 
one  new  axis,  perpendicular  to  the  x  and  y  axes  we  already  understand.  We  could,  for 
example,  add  a  third  axis,  the  z  axis,  with  the  positive  z  axis  coming  straight  out  of  the 
page,  and  the  negative  z  axis  going  out  the  back  of  the  page.  This  is  difficult  to  work  with 
on  a  printed  page,  so  more  often  we  draw  a  view  of  the  three  axes  from  an  angle: 


y 


X 


z 
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You  must  then  imagine  that  the  z  axis  is  perpendicular  to  the  other  two.  Just  as  we  have 
investigated  functions  of  the  form  y  =  j{x)  in  two  dimensions,  we  will  investigate  three 
dimensions  largely  by  considering  functions;  now  the  functions  will  (typically)  have  the 
form  z  =  f{x,y).  Because  we  are  used  to  having  the  result  of  a  function  graphed  in  the 
vertical  direction,  it  is  somewhat  easier  to  maintain  that  convention  in  three  dimensions. 
To  accomplish  this,  we  normally  rotate  the  axes  so  that  z  points  up;  the  result  is  then: 


X 


Note  that  if  you  imagine  looking  down  from  above,  along  the  z  axis,  the  positive  z  axis 
will  come  straight  toward  you,  the  positive  y  axis  will  point  up,  and  the  positive  x  axis 
will  point  to  your  right,  as  usual.  Any  point  in  space  is  identified  by  providing  the  three 
coordinates  of  the  point,  as  shown;  naturally,  we  list  the  coordinates  in  the  order  (x,  y,  z). 
One  useful  way  to  think  of  this  is  to  use  the  x  and  y  coordinates  to  identify  a  point  in  the 
x-y  plane,  then  move  straight  up  (or  down)  a  distance  given  by  the  z  coordinate. 

It  is  now  fairly  simple  to  understand  some  "shapes"  in  three  dimensions  that  corre- 
spond to  simple  conditions  on  the  coordinates.  In  two  dimensions  the  equation  x  =  1 
describes  the  vertical  line  through  (1,0).  In  three  dimensions,  it  still  describes  all  points 
with  x-coordinate  1,  but  this  is  now  a  plane,  as  in  figure  12.1.1. 

Recall  the  very  useful  distance  formula  in  two  dimensions:  the  distance  between  points 
(xi,  2/1 )  and  (x2, 2/2)  is  a/ (xi  —  ^2)^  +  {yi  —  2/2)^;  this  comes  directly  from  the  Pythagorean 
theorem.  What  is  the  distance  between  two  points  (xi,yi,2i)  and  (x2,y2,-22)  in  three 
dimensions?  Geometrically,  we  want  the  length  of  the  long  diagonal  labeled  c  in  the 
"box"  in  figure  12.1.2.  Since  a,  6,  c  form  a  right  triangle,  +  6^  =  c^.  b  is  the  vertical 
distance  between  (xi,yi,2i)  and  (x2, 2/2,  2:2),  so  6  =  \zi  —  Z2\.  The  length  a  runs  parallel 
to  the  x-y  plane,  so  it  is  simply  the  distance  between  (xi,2/i)  and  (x2, 2/2)5  that  is,  = 


1/2  J 


Now  we  see  that      =  (xi  —  X2)^  +  (2/1  —  2/2)^  +  {zi  —  ^2)^  and 


(xi  -  X2)^  +  (2/1 
c  =  a/ (xi  -  X2)2  +  (2/1  -  2/2)^  +  (zi  -  Z2Y. 

It  is  sometimes  useful  to  give  names  to  points,  for  example  we  might  let  Pi  = 
(xi,2/i,^i),  or  more  concisely  we  might  refer  to  the  point  P\{x\,y\,z\),  and  subsequently 
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1.0 


Figure  12.1.1     The  plane  x  =  1. 

use  just  Pi.  Distance  between  two  points  in  either  two  or  three  dimensions  is  some- 
times denoted  by  d,  so  for  example  the  formula  for  the  distance  between  Pi{xi,  yi,zi)  and 
-^2(2^2, 2/2,  ^2)  might  be  expressed  as 

d{Pi,P2)  =  v/(xi -X2)2  +  (yi- 2/2)2  +  (^1-^2)2. 


z 


y 


X 

Figure  12.1.2     Distance  in  three  dimensions. 

In  two  dimensions,  the  distance  formula  immediately  gives  us  the  equation  of  a  circle: 
the  circle  of  radius  r  and  center  at  {h,k)  consists  of  all  points  at  distance  r  from 
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(/i,  k),  so  the  equation  is  r  =  a/(x  —  /i)^  +  {y  —  k)'^  or  =  (x  —  /i)^  +  (2/  —  ^)^-  Now  we 
can  get  the  similar  equation  =  (x  —  h)"^  +  (y  —  k)"^  +  (z  —  Z)^,  which  describes  aU  points 
(x,  y,  z)  at  distance  r  from  (/i,  /c,  /),  namely,  the  sphere  with  radius  r  and  center  (/i,  /c,  /). 

Exercises  12.1. 

1.  Sketch  the  location  of  the  points  (1,1,0),  (2,3,-1),  and  (—1,2,3)  on  a  single  set  of  axes. 

2.  Describe  geometrically  the  set  of  points  (x,  y,  z)  that  satisfy  2  =  4. 

3.  Describe  geometrically  the  set  of  points  (x,  y,  z)  that  satisfy  y  =  —3. 

4.  Describe  geometrically  the  set  of  points  (x,  y,  z)  that  satisfy  a;  +  y  =  2. 

5.  The  equation  x  +  y  +  z  =  1  describes  some  collection  of  points  in  M^.  Describe  and  sketch 
the  points  that  satisfy  x  +  y  -\-  z  =  1  and  are  in  the  x-y  plane,  in  the  x-z  plane,  and  in  the 
y-z  plane. 

6.  Find  the  lengths  of  the  sides  of  the  triangle  with  vertices  (1,  0, 1),  (2,  2,  —1),  and  (—3,  2,  —2). 
=^ 

7.  Find  the  lengths  of  the  sides  of  the  triangle  with  vertices  (2,2,3),  (8,6,5),  and  (  —  1,0,2). 
Why  do  the  results  tell  you  that  this  isn't  really  a  triangle?  =r- 

8.  Find  an  equation  of  the  sphere  with  center  at  (1, 1, 1)  and  radius  2. 

9.  Find  an  equation  of  the  sphere  with  center  at  (2,  —1,  3)  and  radius  5.  =r- 

10.  Find  an  equation  of  the  sphere  with  center  (3,  —2, 1)  and  that  goes  through  the  point  (4,  2,  5). 

11.  Find  an  equation  of  the  sphere  with  center  at  (2, 1,  —1)  and  radius  4.  Find  an  equation  for 
the  intersection  of  this  sphere  with  the  y-z  plane;  describe  this  intersection  geometrically.  =f- 

12.  Consider  the  sphere  of  radius  5  centered  at  (2,3,4).  What  is  the  intersection  of  this  sphere 
with  each  of  the  coordinate  planes? 

13.  Show  that  for  all  values  of  6  and  4>,  the  point  (a  sin  </>  cos  0,  a  sine/)  sin  0,  a  cose/))  lies  on  the 
sphere  given  by  x"^  +  y"^  +  z"^  =  a' . 

14.  Prove  that  the  midpoint  of  the  line  segment  connecting  (xi,yi,zi)  to  {x2,y2,Z2)  is  at 
fxi  +  X2  yi  +  y2  zi  +  Z2\ 

V     2     '      2     '      2  /■ 

15.  Any  three  points  Pi{xi,  yi,  zi),  P2{x2,y2,  ^2),  Psixs,  ys,  Z3),  lie  in  a  plane  and  form  a  triangle. 
The  triangle  inequality  says  that  d{Pi,P3)  <  d{Pi,P2)  +  d(P2,P3).  Prove  the  triangle 
inequality  using  either  algebra  (messy)  or  the  law  of  cosines  (less  messy). 

16.  Is  it  possible  for  a  plane  to  intersect  a  sphere  in  exactly  two  points?  Exactly  one  point? 
Explain. 

12.2  Vectors 

A  vector  is  a  quantity  consisting  of  a  non-negative  magnitude  and  a  direction.  We  could 
represent  a  vector  in  two  dimensions  as  (m,^),  where  m  is  the  magnitude  and  9  is  the 
direction,  measured  as  an  angle  from  some  agreed  upon  direction.  For  example,  we  might 
think  of  the  vector  (5,45°)  as  representing  "5  km  toward  the  northeast";  that  is,  this 
vector  might  be  a  displacement  vector,  indicating,  say,  that  your  grandfather  walked 
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5  kilometers  toward  the  northeast  to  school  in  the  snow.  On  the  other  hand,  the  same 
vector  could  represent  a  velocity,  indicating  that  your  grandfather  walked  at  5  km/hr 
toward  the  northeast.  What  the  vector  does  not  indicate  is  where  this  walk  occurred:  a 
vector  represents  a  magnitude  and  a  direction,  but  not  a  location.  Pictorially  it  is  useful 
to  represent  a  vector  as  an  arrow;  the  direction  of  the  vector,  naturally,  is  the  direction  in 
which  the  arrow  points;  the  magnitude  of  the  vector  is  reflected  in  the  length  of  the  arrow. 

It  turns  out  that  many,  many  quantities  behave  as  vectors,  e.g.,  displacement,  velocity, 
acceleration,  force.  Already  we  can  get  some  idea  of  their  usefulness  using  displacement 
vectors.  Suppose  that  your  grandfather  walked  5  km  NE  and  then  2  km  SSE;  if  the  terrain 
allows,  and  perhaps  armed  with  a  compass,  how  could  your  grandfather  have  walked 
directly  to  his  destination?  We  can  use  vectors  (and  a  bit  of  geometry)  to  answer  this 
question.  We  begin  by  noting  that  since  vectors  do  not  include  a  specification  of  position, 
we  can  "place"  them  anywhere  that  is  convenient.  So  we  can  picture  your  grandfather's 
journey  as  two  displacement  vectors  drawn  head  to  tail: 


The  displacement  vector  for  the  shortcut  route  is  the  vector  drawn  with  a  dashed  line, 
from  the  tail  of  the  first  to  the  head  of  the  second.  With  a  little  trigonometry,  we  can 
compute  that  the  third  vector  has  magnitude  approximately  4.62  and  direction  21.43°, 
so  walking  4.62  km  in  the  direction  21.43°  north  of  east  (approximately  ENE)  would  get 
your  grandfather  to  school.  This  sort  of  calculation  is  so  common,  we  dignify  it  with  a 
name:  we  say  that  the  third  vector  is  the  sum  of  the  other  two  vectors.  There  is  another 
common  way  to  picture  the  sum  of  two  vectors.  Put  the  vectors  tail  to  tail  and  then 
complete  the  parallelogram  they  indicate;  the  sum  of  the  two  vectors  is  the  diagonal  of  the 
parallelogram: 


This  is  a  more  natural  representation  in  some  circumstances.  For  example,  if  the  two 
original  vectors  represent  forces  acting  on  an  object,  the  sum  of  the  two  vectors  is  the 
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net  or  effective  force  on  the  object,  and  it  is  nice  to  draw  all  three  with  their  tails  at  the 
location  of  the  object. 

We  also  define  scalar  multiplication  for  vectors:  if  A  is  a  vector  (m,  9)  and  a  >  0 

is  a  real  number,  the  vector  a  A  is  {am,  6) ,  namely,  it  points  in  the  same  direction  but 
has  a  times  the  magnitude.  If  a  <  0,  a  A  is  (|a|m,^  +  7r),  with  \a\  times  the  magnitude 
and  pointing  in  the  opposite  direction  (unless  we  specify  otherwise,  angles  are  measured 
in  radians). 

Now  we  can  understand  subtraction  of  vectors:  A  —  B  =  A  +  (— 1)B: 


Note  that  as  you  would  expect,  B  +  (A  —  B)  =  A. 

We  can  represent  a  vector  in  ways  other  than  (m,  6),  and  in  fact  (m,  6)  is  not  generally 
used  at  all.  How  else  could  we  describe  a  particular  vector?  Consider  again  the  vector 
(5,45°).  Let's  draw  it  again,  but  impose  a  coordinate  system.  If  we  put  the  tail  of  the 
arrow  at  the  origin,  the  head  of  the  arrow  ends  up  at  the  point  (5/-\/2,  5/-\/2)     (3.54, 3.54). 

(3.54,3.54) 


In  this  picture  the  coordinates  (3.54,3.54)  identify  the  head  of  the  arrow,  provided  we 
know  that  the  tail  of  the  arrow  has  been  placed  at  (0,0).  Then  in  fact  the  vector  can 
always  be  identified  as  (3.54,  3.54),  no  matter  where  it  is  placed;  we  just  have  to  remember 
that  the  numbers  3.54  must  be  interpreted  as  a  change  from  the  position  of  the  tail,  not 
as  the  actual  coordinates  of  the  arrow  head;  to  emphasize  this  we  will  write  (3.54,  3.54) 
to  mean  the  vector  and  (3.54,  3.54)  to  mean  the  point.  Then  if  the  vector  (3.54, 3.54)  is 
drawn  with  its  tail  at  (1,2)  it  looks  like  this: 


(4.54,5.54) 
3.54 


3.54 
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Consider  again  the  two  part  trip:  5  km  NE  and  then  2  km  SSE.  The  vector  representing 
the  first  part  of  the  trip  is  {5/V2,  5/V2),  and  the  second  part  of  the  trip  is  represented  by 
(2cos(— 37r/8),  2sin(— 37r/8))  ^  (0.77,  —1.85).  We  can  represent  the  sum  of  these  with  the 
usual  head  to  tail  picture: 


3.54 

0.77 

/    \  1 

\  1  -1.85 

/  ^ 

\l 

J  (4.3, 1.69) 

It  is  clear  from  the  picture  that  the  coordinates  of  the  destination  point  are  (5/-\/2  + 
2cos(— 37r/8),  5/\/2  +  2sin(— Stt/S))  or  approximately  (4.3,1.69),  so  the  sum  of  the  two 
vectors  is  +  2  cos(-37r/8),  5/^2  +  2  sin(-37r/8))  ^  (4.3,1.69).    Adding  the  two 

vectors  is  easier  in  this  form  than  in  the  (m,  ff)  form,  provided  that  we're  willing  to  have 
the  answer  in  this  form  as  well. 

It  is  easy  to  see  that  scalar  multiplication  and  vector  subtraction  are  also  easy  to 
compute  in  this  form:  a{v,w)  =  {av.aw)  and  {vi.Wi)  —  {v2jW2)  =  (fi  —  V2,wi  —  102). 
What  about  the  magnitude?  The  magnitude  of  the  vector  {v,  w)  is  still  the  length  of  the 
corresponding  arrow  representation;  this  is  the  distance  from  the  origin  to  the  point  (f ,  w), 
namely,  the  distance  from  the  tail  to  the  head  of  the  arrow.  We  know  how  to  compute 
distances,  so  the  magnitude  of  the  vector  is  simply  \/v'^  +  w'^,  which  we  also  denote  with 
absolute  value  bars:  \{v,w)\  —  \/f ^  +  w^. 

In  three  dimensions,  vectors  are  still  quantities  consisting  of  a  magnitude  and  a  direc- 
tion, but  of  course  there  are  many  more  possible  directions.  It's  not  clear  how  we  might 
represent  the  direction  explicitly,  but  the  coordinate  version  of  vectors  makes  just  as  much 
sense  in  three  dimensions  as  in  two.  By  (1,2,3)  we  mean  the  vector  whose  head  is  at 
(1,2,3)  if  its  tail  is  at  the  origin.  As  before,  we  can  place  the  vector  anywhere  we  want; 
if  it  has  its  tail  at  (4,5,6)  then  its  head  is  at  (5,7,9).  It  remains  true  that  arithmetic  is 
easy  to  do  with  vectors  in  this  form: 

a{vi,V2,V3)  =  {avi,av2,av3) 

{Vi,  V2,  V3)  +  {wi,W2,  W3)  =  {Vi  +  Wi,V2  +  W2,V3  +  W3) 
{V1,V2,  V3)  -  {Wi,  W2,  W3)  =  {Vi  -Wi,V2-  W2,  V3  -  W3) 

The  magnitude  of  the  vector  is  again  the  distance  from  the  origin  to  the  head  of  the  arrow, 
or  \{vi,V2,V3)\  =  Jv^  +  vl  +  vj. 
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5. 


4 


V 


X 


Figure  12.2.1     The  vector  (2,4,5)  with  its  tail  at  the  origin. 


Three  particularly  simple  vectors  turn  out  to  be  quite  useful:  i  =  (1,  0,  0),  j  =  (0, 1,  0), 
and  k  =  (0,  0, 1).  These  play  much  the  same  role  for  vectors  that  the  axes  play  for  points. 
In  particular,  notice  that 


We  will  frequently  want  to  produce  a  vector  that  points  from  one  point  to  another. 
That  is,  if  P  and  Q  are  points,  we  seek  the  vector  x  such  that  when  the  tail  of  x  is  placed 
at  P,  its  head  is  at  Q;  we  refer  to  this  vector  as  PQ.  If  we  know  the  coordinates  of  P  and 
(5,  the  coordinates  of  the  vector  are  easy  to  find. 


Exercises  12.2. 

1.  Draw  the  vector  (3,  —1)  with  its  tail  at  the  origin. 

2.  Draw  the  vector  (3,  —1,  2)  with  its  tail  at  the  origin. 

3.  Let  A  be  the  vector  with  tail  at  the  origin  and  head  at  (1,2);  let  B  be  the  vector  with  tail 
at  the  origin  and  head  at  (3, 1).  Draw  A  and  B  and  a  vector  C  with  tail  at  (1,  2)  and  head 
at  (3, 1).  Draw  C  with  its  tail  at  the  origin. 

4.  Let  A  be  the  vector  with  tail  at  the  origin  and  head  at  (—1,  2);  let  B  be  the  vector  with  tail 
at  the  origin  and  head  at  (3,  3).  Draw  A  and  B  and  a  vector  C  with  tail  at  (—1,2)  and  head 
at  (3,  3).  Draw  C  with  its  tail  at  the  origin. 


(^1,  ^2,  ^3)  =  (^1,0,  0)  +  (0,  V2,  0)  +  (0,  0,  v^) 

=  ^;i(l,0,0)+^;2(0,l,0)+^;3(0,0,l) 
=  vii  +  V2}  +  vsk 


EXAMPLE  12.2.1 


Suppose  P  =  (1,-2,4)  and  Q  =  (-2,1,3).   The  vector 


□ 
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5.  Let  A  be  the  vector  with  tail  at  the  origin  and  head  at  (5,2);  let  B  be  the  vector  with  tail 
at  the  origin  and  head  at  (1,  5).  Draw  A  and  B  and  a  vector  C  with  tail  at  (5,  2)  and  head 
at  (1,5).  Draw  C  with  its  tail  at  the  origin. 

6.  Find  |v|,  v  +  w,  v  —  w,  |v  +  w|,  |v  —  w|  and  — 2v  for  v  =  (1,  3)  and  w  =  (—1,  —5).  =^ 

7.  Find  |v|,  v  +  w,  v  —  w,  |v  +  w|,  |v  —  w|  and  — 2v  for  v  =  (1,  2,  3)  and  w  =  (—1,  2,  —3).  =^> 

8.  Find  |v|,  v  +  w,  v  —  w,  |v  +  w|,  |v  —  w|  and  — 2v  for  v  =  (1,  0, 1)  and  w  =  (—1,  —2,  2).  =^> 

9.  Find  |v|,  v  +  w,  v  —  w,  |v  +  w|,  |v  —  w|  and  —  2v  for  v  =  (1,  —  1, 1)  and  w  =  (0, 0,  3).  =^> 

10.  Find  |v|,  v  +  w,  v  —  w,  |v  +  w|,  |v  —  w|  and  —  2v  for  v  =  (3,  2, 1)  and  w  =  (  — 1,— 1,— 1).  =^> 

11.  Let  P  =  (4,  5,  6),  Q  =  (1,  2,  —5).  Find  PQ.  Find  a  vector  with  the  same  direction  as  PQ  but 
with  length  1.  Find  a  vector  with  the  same  direction  as  PQ  but  with  length  4. 

12.  If  A,  B,  and  C  are  three  points,  find  AB  +  PC  +  CJA.  =^ 

13.  Consider  the  12  vectors  that  have  their  tails  at  the  center  of  a  clock  and  their  respective 
heads  at  each  of  the  12  digits.  What  is  the  sum  of  these  vectors?  What  if  we  remove  the 
vector  corresponding  to  4  o'clock?  What  if,  instead,  all  vectors  have  their  tails  at  12  o'clock, 
and  their  heads  on  the  remaining  digits?  =^> 

14.  Let  a  and  b  be  nonzero  vectors  in  two  dimensions  that  are  not  parallel  or  anti-parallel. 
Show,  algebraically,  that  if  c  is  any  two  dimensional  vector,  there  are  scalars  s  and  t  such 
that  c  =  sa  +  tb. 

15.  Does  the  statement  in  the  previous  exercise  hold  if  the  vectors  a,  b,  and  c  are  three  dimen- 
sional vectors?  Explain. 


12.3    The  Dot  Product 

Here's  a  question  whose  answer  turns  out  to  be  very  useful:  Given  two  vectors,  what  is 
the  angle  between  them? 

It  may  not  be  immediately  clear  that  the  question  makes  sense,  but  it's  not  hard  to  turn 
it  into  a  question  that  does.  Since  vectors  have  no  position,  we  are  as  usual  free  to  place 
vectors  wherever  we  like.  If  the  two  vectors  are  placed  tail-to-tail,  there  is  now  a  reasonable 
interpretation  of  the  question:  we  seek  the  measure  of  the  smallest  angle  between  the  two 
vectors,  in  the  plane  in  which  they  lie.  Figure  12.3.1  illustrates  the  situation. 


Figure  12.3.1     The  angle  between  vectors  A  and  B. 

Since  the  angle  6  lies  in  a  triangle,  we  can  compute  it  using  a  bit  of  trigonometry, 
namely,  the  law  of  cosines.  The  lengths  of  the  sides  of  the  triangle  in  figure  12.3.1  are  |A|, 
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|A-B|. 

Let  A 

02,03) 

and  B  = 

|A-B|2 

=  |A|2 

+ 

|B|2 

-2|A| 

|B|cos6' 

.||B|cos^ 

=  |Ap 

+ 

-  |A- 

=  al  +  al  +  al  +  bl  +  bl  +  bl-  (ai  -  bif  -  (02  -  62)^  -  (03  -  b^f 
=  al  +  al  +  al  +  bl+bl  +  bl 

-  (a?  -  2ai&i  +      -  {al  -  20262  +  ^2)  -  (03  -  20363  +  63) 
=  2ai6i  +  2a2&2  +  20363 
|A||B|  cos^  =  ai6i  +  02^2  +  0363 

cos^  =  (aibi  +  0262  +  0363)7(1  A||B|) 

So  a  bit  of  simple  arithmetic  with  the  coordinates  of  A  and  B  allows  us  to  compute  the 
cosine  of  the  angle  between  them.  If  necessary  we  can  use  the  arccosine  to  get  6,  but  in 
many  problems  cos  9  turns  out  to  be  all  we  really  need. 

The  numerator  of  the  fraction  that  gives  us  cos  9  turns  up  a  lot,  so  we  give  it  a  name 
and  more  compact  notation:  we  call  it  the  dot  product,  and  write  it  as 

A  •  B  =  ai6i  +  0262  +  0363- 

This  is  the  same  symbol  we  use  for  ordinary  multiplication,  but  there  should  never  be  any 
confusion;  you  can  tell  from  context  whether  we  are  "multiplying"  vectors  or  numbers. 
(We  might  also  use  the  dot  for  scalar  multiplication:  a  •  V  =  aV;  again,  it  is  clear  what  is 
meant  from  context.) 

EXAMPLE  12.3.1  Find  the  angle  between  the  vectors  A  =  (1,  2, 1)  and  B  =  (3, 1,  -5). 
We  know  that  cos^  =  A  ■  B/(|A||B|)  =  (1  •  3  +  2  •  1  +  1  •  (-5))/(|A||B|)  =  0,  so  6'  =  n/2, 
that  is,  the  vectors  are  perpendicular.  □ 

EXAMPLE  12.3.2  Find  the  angle  between  the  vectors  A  =  (3, 3,  0)  and  B  =  (1,  0, 0). 
We  compute 

cos^  =  (3  •  1  +  3  •  0  +  0  •  0)/(V9  +  9  +  0Vl  +  0  +  0) 
=  3/v^= 

so  6'  =  7r/4.  □ 


EXAMPLE  12.3.3  Some  special  cases  are  worth  looking  at:  Find  the  angles  between 
A  and  A;  A  and  —A;  A  and  0  =  (0,  0,  0). 
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cos6'  =  A  •  A/(|A||A|)  =  {af  +  +  a\)/{s^a\  +  a|  +  al^J aj  +  a|  +  a§)  =  1,  so  the 
angle  between  A  and  itself  is  zero,  which  of  course  is  correct. 


cos6'  =  A  -  -A/(|A||-A|)  =  {-al- 4~  al)/{Jal  +  4  +  alJal  +  al+al)  =  -1, 


so  the  angle  is  it,  that  is,  the  vectors  point  in  opposite  directions,  as  of  course  we  already 
knew. 


cos^  =  A-0/(|A||0|)  =  (0  +  0  +  0)/(  Ja?  +     +  ajVo^  +  02  +  02),  which  is  undefined. 


On  the  other  hand,  note  that  since  A  ■  0  =  0  it  looks  at  first  as  if  cos  9  will  be  zero,  which 
as  we  have  seen  means  that  vectors  are  perpendicular;  only  when  we  notice  that  the 
denominator  is  also  zero  do  we  run  into  trouble.  One  way  to  "fix"  this  is  to  adopt  the 
convention  that  the  zero  vector  0  is  perpendicular  to  all  vectors;  then  we  can  say  in  general 


Generalizing  the  examples,  note  the  following  useful  facts: 

1.  If  A  is  parallel  or  anti-parallel  to  B  then  A-B/(|A||B|)  =  ±1,  and  conversely, 
if  A  ■  B/(|A||B|)  =  1,  A  and  B  are  parallel,  while  if  A  •  B/(|A||B|)  =  -1,  A 
and  B  are  anti-parallel.  (Vectors  are  parallel  if  they  point  in  the  same  direction, 
anti-parallel  if  they  point  in  opposite  directions.) 

2.  If  A  is  perpendicular  to  B  then  A  ■  B/(|A||B|)  =  0,  and  conversely  if  A  ■ 
B/(|A||B|)  =  0  then  A  and  B  are  perpendicular. 

Given  two  vectors,  it  is  often  useful  to  find  the  projection  of  one  vector  onto  the 
other,  because  this  turns  out  to  have  important  meaning  in  many  circumstances.  More 
precisely,  given  A  and  B,  we  seek  a  vector  parallel  to  B  but  with  length  determined  by  A 
in  a  natural  way,  as  shown  in  figure  12.3.2.  V  is  chosen  so  that  the  triangle  formed  by  A, 
V,  and  A  —  V  is  a  right  triangle. 


that  if  A  ■  B  =  0,  A  and  B  are  perpendicular. 


□ 


A 


Figure  12.3.2     V  is  the  projection  of  A  onto  B. 


Using  a  little  trigonometry,  we  see  that 
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this  is  sometimes  called  the  scalar  projection  of  A  onto  B.  To  get  V  itself,  we  multiply 
this  length  by  a  vector  of  length  one  parallel  to  B: 


V  = 


A  B  B       A  B 


B. 


|B|    |B|  |B|2 

Be  sure  that  you  understand  why  B/|B|  is  a  vector  of  length  one  (also  called  a  unit 
vector)  parallel  to  B. 

The  discussion  so  far  implicitly  assumed  that  0<^<7r/2.  If7r/2<^<7r,  the  picture 
is  like  figure  12.3.3.  In  this  case  A  ■  B  is  negative,  so  the  vector 


A  B 


B 


is  anti-parallel  to  B,  and  its  length  is 


A  B 


IBI 


So  in  general,  the  scalar  projection  of  A  onto  B  may  be  positive  or  negative.  If  it  is 
negative,  it  means  that  the  projection  vector  is  anti-parallel  to  B  and  that  the  length  of 
the  projection  vector  is  the  absolute  value  of  the  scalar  projection.  Of  course,  you  can  also 
compute  the  length  of  the  projection  vector  as  usual,  by  applying  the  distance  formula  to 
the  vector. 


B 


Figure  12.3.3     V  is  the  projection  of  A  onto  B. 

Note  that  the  phrase  "projection  onto  B"  is  a  bit  misleading  if  taken  literally;  all  that 
B  provides  is  a  direction;  the  length  of  B  has  no  impact  on  the  final  vector.  In  figure  12.3.4, 
for  example,  B  is  shorter  than  the  projection  vector,  but  this  is  perfectly  acceptable. 


Figure  12.3.4     V  is  the  projection  of  A  onto  B. 
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EXAMPLE  12.3.4  Physical  force  is  a  vector  quantity.  It  is  often  necessary  to  compute 
the  "component"  of  a  force  acting  in  a  different  direction  than  the  force  is  being  applied. 
For  example,  suppose  a  ten  pound  weight  is  resting  on  an  inclined  plane — a  pitched  roof, 
for  example.  Gravity  exerts  a  force  of  ten  pounds  on  the  object,  directed  straight  down. 
It  is  useful  to  think  of  the  component  of  this  force  directed  down  and  parallel  to  the  roof, 
and  the  component  down  and  directly  into  the  roof.  These  forces  are  the  projections  of  the 
force  vector  onto  vectors  parallel  and  perpendicular  to  the  roof.  Suppose  the  roof  is  tilted 
at  a  30°  angle,  as  in  figure  12.3.5.  A  vector  parallel  to  the  roof  is  {—Vd,  —1),  and  a  vector 
perpendicular  to  the  roof  is  (1,  —  V^).  The  force  vector  is  F  =  (0,  —10).  The  component 
of  the  force  directed  down  the  roof  is  then 

|{-V3,-1)|2  ^       '     '      2        2  ^        /  .     /  / 

with  length  5.  The  component  of  the  force  directed  into  the  roof  is 

|(l,-v^)|2  ^  '    ^  ^        2  2  \  ^  /  '       /  / 

with  length  5v^.  Thus,  a  force  of  5  pounds  is  pulling  the  object  down  the  roof,  while  a 
force  of  5V3  pounds  is  pulling  the  object  into  the  roof.  □ 


Figure  12.3.5     Components  of  a  force. 


The  dot  product  has  some  familiar-looking  properties  that  will  be  useful  later,  so  we 
list  them  here.  These  may  be  proved  by  writing  the  vectors  in  coordinate  form  and  then 
performing  the  indicated  calculations;  subsequently  it  can  be  easier  to  use  the  properties 
instead  of  calculating  with  coordinates. 

THEOREM  12.3.5    If  u,  v,  and  w  are  vectors  and  a  is  a  real  number,  then 
1.  u  ■  u  =  lup 
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2.  U  V 


=  V  ■  u 


3.  u  •  (v  +  w)  =  u  ■  V  +  u  •  w 

4.  (au)  ■  V  =  a(u  •  v)  =  u  •  (av) 


Exercises  12.3. 

1.  Find  (1,1,1)  •  (2,-3,4).  ^ 

2.  Find  (1,2,0)  •  (0,0,57). 

3.  Find  (3,2,1)  •  (0,1,0). 

4.  Find  (-1, -2,  5)  •  (1,  0, -1).  ^ 

5.  Find  (3,4,6)  •  (2,3,4). 

6.  Find  the  cosine  of  the  angle  between  (1,2,3)  and  (1,1,1);  use  a  calculator  if  necessary  to 
find  the  angle.  =^ 

7.  Find  the  cosine  of  the  angle  between  (  —  1,  —2,  —3)  and  (5,  0,  2);  use  a  calculator  if  necessary 
to  find  the  angle.  =^ 

8.  Find  the  cosine  of  the  angle  between  (47, 100, 0)  and  (0,  0,  5);  use  a  calculator  if  necessary  to 
find  the  angle.  =f- 

9.  Find  the  cosine  of  the  angle  between  (1,0,1)  and  (0,1,1);  use  a  calculator  if  necessary  to 
find  the  angle.  =^ 

10.  Find  the  cosine  of  the  angle  between  (2,  0,  0)  and  (—1, 1,  —1);  use  a  calculator  if  necessary  to 
find  the  angle. 

11.  Find  the  angle  between  the  diagonal  of  a  cube  and  one  of  the  edges  adjacent  to  the  diagonal. 

12.  Find  the  scalar  and  vector  projections  of  (1,2,3)  onto  (1,2,0).  =^ 

13.  Find  the  scalar  and  vector  projections  of  (1, 1, 1)  onto  (3,  2, 1). 

14.  A  force  of  10  pounds  is  applied  to  a  wagon,  directed  at  an  angle  of  30°.  Find  the  component 
of  this  force  pulling  the  wagon  straight  up,  and  the  component  pulling  it  horizontally  along 
the  ground. 


15.  A  force  of  15  pounds  is  applied  to  a  wagon,  directed  at  an  angle  of  45°.  Find  the  component 
of  this  force  pulling  the  wagon  straight  up,  and  the  component  pulling  it  horizontally  along 
the  ground.  =^ 

16.  Use  the  dot  product  to  find  a  non-zero  vector  w  perpendicular  to  both  u  =  (1,2,-3)  and 


F 


Figure  12.3.6     Pulling  a  wagon. 


V  = 


(2,0,1).  ^ 
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17.  Let  X  =  (1, 1,  0)  and  y  =  (2,  4,  2).  Find  a  unit  vector  that  is  perpendicular  to  both  x  and  y. 
=^ 

18.  Do  the  three  points  (1,  2,  0),  (—2, 1, 1),  and  (0,  3,  —1)  form  a  right  triangle? 

19.  Do  the  three  points  (1, 1, 1),  (2,  3,  2),  and  (5,  0,  —1)  form  a  right  triangle?  =^> 

20.  Show  that  |A  •  B|  <  |A||B| 

21.  Let  X  and  y  be  perpendicular  vectors.  Use  Theorem  12.3.5  to  prove  that  |x|^  +  |y|'^  =  jx+yj'^. 
What  is  this  result  better  known  as? 

22.  Prove  that  the  diagonals  of  a  rhombus  intersect  at  right  angles. 

23.  Suppose  that  z  =  |x|y  +  |y|x  where  x,  y,  and  z  are  all  nonzero  vectors.  Prove  that  z  bisects 
the  angle  between  x  and  y. 

24.  Prove  Theorem  12.3.5. 


12.4    The  Cross  Product 

Another  useful  operation:  Given  two  vectors,  find  a  third  vector  perpendicular  to  the 
first  two.  There  are  of  course  an  infinite  number  of  such  vectors  of  different  lengths. 
Nevertheless,  let  us  find  one.  Suppose  A  =  (01,02,03)  and  B  =  (61,625^3)-  We  want  to 
find  a  vector  v  =  {vi,  V2,  fa)  with  v  •  A  =  v  ■  B  =  0,  or 

aivi  +  a2V2  +  asvs  =  0, 
bivi  +  62^2  +  ^3^3  =  0. 

Multiply  the  first  equation  by  63  and  the  second  by  03  and  subtract  to  get 

6301^1  +  6302^2  +  ^303^3  =  0 
0361^1  +  0362^2  +  asbsvs  =  0 
(0163  -  6103)^1  +  (0263  -  b2a3)v2  =  0 

Of  course,  this  equation  in  two  variables  has  many  solutions;  a  particularly  easy  one  to 
see  is  vi  =  0263  —  62^3,  ^2  =  bias  —  0163.  Substituting  back  into  either  of  the  original 
equations  and  solving  for  vs  gives  ^3  =  0162  —  ^102- 

This  particular  answer  to  the  problem  turns  out  to  have  some  nice  properties,  and  it 
is  dignified  with  a  name:  the  cross  product: 

A  X  B  =  (0263  -  ^2^3,  ^1^3  -  0163,  0162  -  bia2). 

While  there  is  a  nice  pattern  to  this  vector,  it  can  be  a  bit  difficult  to  memorize;  here  is  a 
convenient  mnemonic.  The  determinant  of  a  two  by  two  matrix  is 
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This  is  extended  to  the  determinant  of  a  three  by  three  matrix: 


X 

bi 


y  z 

&2  ^3 


X 


02  as 
h2  bs 


bi 


as 
b3 


+  z 


bi 


a2 
b2 


=  x{a2bz  -  b2az)  -  y{aibz  -  6103)  +  z{aib2  -  6102) 
=  ^(0263  -  6203)  +  2/(6103  -  0163)  +  z{aih2  -  bia2). 

Each  of  the  two  by  two  matrices  is  formed  by  deleting  the  top  row  and  one  column  of  the 
three  by  three  matrix;  the  subtraction  of  the  middle  term  must  also  be  memorized.  This 
is  not  the  place  to  extol  the  uses  of  the  determinant;  suffice  it  to  say  that  determinants 
are  extraordinarily  useful  and  important.  Here  we  want  to  use  it  merely  as  a  mnemonic 
device.  You  will  have  noticed  that  the  three  expressions  in  parentheses  on  the  last  line  are 
precisely  the  three  coordinates  of  the  cross  product;  replacing  x,  y,  z  by  i,  j,  k  gives  us 


1 

ai 
bi 


j  k 

a2  as 
62  bs 


(0263  -  6203)1  -  (0163  -  6ia3)j  +  (0162  -  6ia2)k 

=  (0263  -  6203)1  +  (6ia3  -  0163 )j  +  (ai62  -  6ia2)k 
=  (0263  -  6203,  6103  -  0163,  0162  -  6102) 
=  A  X  B. 

Given  A  and  B,  there  are  typically  two  possible  directions  and  an  infinite  number  of 
magnitudes  that  will  give  a  vector  perpendicular  to  both  A  and  B.  As  we  have  picked  a 
particular  one,  we  should  investigate  the  magnitude  and  direction. 

We  know  how  to  compute  the  magnitude  of  A  x  B;  it's  a  bit  messy  but  not  difficult. 
It  is  somewhat  easier  to  work  initially  with  the  square  of  the  magnitude,  so  as  to  avoid  the 
square  root: 

|A  X  B|^  =  (0263  -  6203)^  +  (6103  -  0163)^  +  (ai62  -  6102)^ 


—  ^ibi  ~  2a26362a3  +  62(23  +  6^03  —  261030163  +  0^63  +  0^63  —  201626102  +  6^03 
While  it  is  far  from  obvious,  this  nasty  looking  expression  can  be  simplified: 


A  X  B|^  =  {al  +  al  +  a^)(6?  +  6^  +  6^)  -  (ai6i  +  0262  +  0363)^ 


=  |A 

=  |A 
=  lA 


(A-B)2 

|A|2|B|2cos2  6' 
'(1  -cos2  6') 


lA  X  BI 


|A|2|B|2sin2  6' 
|A||B|sin6' 
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The  magnitude  of  A  x  B  is  thus  very  similar  to  the  dot  product.  In  particular,  notice  that 
if  A  is  parallel  to  B,  the  angle  between  them  is  zero,  so  sin^  =  0,  so  |A  x  B|  =  0,  and 
likewise  if  they  are  anti-parallel,  sin^  =  0,  and  |A  x  B|  =  0.  Conversely,  if  |A  x  B|  =  0 
and  |A|  and  |B|  are  not  zero,  it  must  be  that  sin 9  =  0,  so  A  is  parallel  or  anti-parallel  to 
B. 

Here  is  a  curious  fact  about  this  quantity  that  turns  out  to  be  quite  useful  later  on: 
Given  two  vectors,  we  can  put  them  tail  to  tail  and  form  a  parallelogram,  as  in  figure  12.4.1. 
The  height  of  the  parallelogram,  h,  is  |A|sin^,  and  the  base  is  |B|,  so  the  area  of  the 
parallelogram  is  |A||B|  sin^,  exactly  the  magnitude  of  |A  x  B|. 


/ 


B 


Figure  12.4.1     A  parallelogram. 


What  about  the  direction  of  the  cross  product?  Remarkably,  there  is  a  simple  rule  that 
describes  the  direction.  Let's  look  at  a  simple  example:  Let  A  =  (a,  0,  0),  B  =  (6,  c,  0).  If 
the  vectors  are  placed  with  tails  at  the  origin,  A  lies  along  the  x-axis  and  B  lies  in  the  x-y 
plane,  so  we  know  the  cross  product  will  point  either  up  or  down.  The  cross  product  is 


A  x  B 


i    j  k 

a  0  0 
b    c  0 


(0,0,  ac). 


As  predicted,  this  is  a  vector  pointing  up  or  down,  depending  on  the  sign  of  ac.  Suppose 
that  a  >  0,  so  the  sign  depends  only  on  c:  if  c  >  0,  ac  >  0  and  the  vector  points  up;  if 
c  <  0,  the  vector  points  down.  On  the  other  hand,  if  a  <  0  and  c  >  0,  the  vector  points 
down,  while  if  a  <  0  and  c  <  0,  the  vector  points  up.  Here  is  how  to  interpret  these  facts 
with  a  single  rule:  Imagine  rotating  vector  A  until  it  points  in  the  same  direction  as  B; 
there  are  two  ways  to  do  this — use  the  rotation  that  goes  through  the  smaller  angle.  If 
a  >  0  and  c  >  0,  or  a  <  0  and  c  <  0,  the  rotation  will  be  counter-clockwise  when  viewed 
from  above;  in  the  other  two  cases,  A  must  be  rotated  clockwise  to  reach  B.  The  rule  is: 
counter-clockwise  means  up,  clockwise  means  down.  If  A  and  B  are  any  vectors  in  the  x-y 
plane,  the  same  rule  applies — A  need  not  be  parallel  to  the  x-axis. 

Although  it  is  somewhat  difficult  computationally  to  see  how  this  plays  out  for  any 
two  starting  vectors,  the  rule  is  essentially  the  same.  Place  A  and  B  tail  to  tail.  The  plane 
in  which  A  and  B  lie  may  be  viewed  from  two  sides;  view  it  from  the  side  for  which  A 
must  rotate  counter-clockwise  to  reach  B;  then  the  vector  A  x  B  points  toward  you. 
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This  rule  is  usually  called  the  right  hand  rule.  Imagine  placing  the  heel  of  your  right 
hand  at  the  point  where  the  tails  are  joined,  so  that  your  slightly  curled  fingers  indicate 
the  direction  of  rotation  from  A  to  B.  Then  your  thumb  points  in  the  direction  of  the 
cross  product  A  x  B. 

One  immediate  consequence  of  these  facts  is  that  A  x  B  7^  B  x  A,  because  the  two 
cross  products  point  in  the  opposite  direction.  On  the  other  hand,  since 

|A  X  B|  =  |A||B|sin^  =  |B||A|  sin 6'  =  |B  x  A|, 

the  lengths  of  the  two  cross  products  are  equal,  so  we  know  that  A  x  B  =  —  (B  x  A). 

The  cross  product  has  some  familiar-looking  properties  that  will  be  useful  later,  so  we 
list  them  here.  As  with  the  dot  product,  these  can  be  proved  by  performing  the  appropriate 
calculations  on  coordinates,  after  which  we  may  sometimes  avoid  such  calculations  by  using 
the  properties. 

THEOREM  12.4.1    If  u,  v,  and  w  are  vectors  and  a  is  a  real  number,  then 

1.  ux(v  +  w)  =  uxv  +  uxw 

2.  (v  +  w)xu  =  vxu  +  wxu 

3.  (au)  X  V  =  a(u  x  v)  =  u  x  (av) 

4.  u  •  (v  X  w)  =  (u  X  v)  ■  w 

5.  u  X  (v  X  w)  =  (u  ■  w)v  —  (u  ■  v)w  ■ 

Exercises  12.4- 

1.  Find  the  cross  product  of  (1,1,1)  and  (1,2,3). 

2.  Find  the  cross  product  of  (1,0,2)  and  (-1,-2,4).  =^> 

3.  Find  the  cross  product  of  (—2, 1,3)  and  (5,2,  —1). 

4.  Find  the  cross  product  of  (1,0,0)  and  (0,0, 1).  =^ 

5.  Two  vectors  u  and  v  are  separated  by  an  angle  of  tt/6,  and  |u|  =  2  and  |v|  =  3.  Find  |u  x  v|. 

6.  Two  vectors  u  and  v  are  separated  by  an  angle  of  7r/4,  and  |u|  =  3  and  |v|  =  7.  Find  |u  x  v|. 

=^ 

7.  Find  the  area  of  the  parallelogram  with  vertices  (0,  0),  (1,  2),  (3,  7),  and  (2,  5). 

8.  Find  and  explain  the  value  of  (i  x  j)  x  k  and  (i  +  j)  x  (i  —  j). 

9.  Prove  that  for  all  vectors  u  and  v,  (u  x  v)  •  v  =  0. 

10.  Prove  Theorem  12.4.1. 

11.  Define  the  triple  product  of  three  vectors,  x,  y,  and  z,  to  be  the  scalar  x  •  (y  x  z).  Show 
that  three  vectors  lie  in  the  same  plane  if  and  only  if  their  triple  product  is  zero.  Verify  that 
(1,  5,  —2),  (4,  3,  0)  and  (6, 13,  —4)  are  coplanar. 
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12.5    Lines  and  Planes 

Lines  and  planes  are  perhaps  the  simplest  of  curves  and  surfaces  in  three  dimensional 
space.  They  also  will  prove  important  as  we  seek  to  understand  more  complicated  curves 
and  surfaces. 

The  equation  of  a  line  in  two  dimensions  is  ax  +  by  =  c;  it  is  reasonable  to  expect  that 
a  line  in  three  dimensions  is  given  by  ax  +  by  +  cz  =  d;  reasonable,  but  wrong — it  turns 
out  that  this  is  the  equation  of  a  plane. 

A  plane  does  not  have  an  obvious  "direction"  as  does  a  line.  It  is  possible  to  associate 
a  plane  with  a  direction  in  a  very  useful  way,  however:  there  are  exactly  two  directions 
perpendicular  to  a  plane.  Any  vector  with  one  of  these  two  directions  is  called  normal  to 
the  plane.  So  while  there  are  many  normal  vectors  to  a  given  plane,  they  are  all  parallel 
or  anti-parallel  to  each  other. 

Suppose  two  points  {vi,  V2,  vs)  and  (tfi,  W2,  w^)  are  in  a  plane;  then  the  vector  {wi  — 
vi,W2  —  V2,ws  —  Vs)  is  parallel  to  the  plane;  in  particular,  if  this  vector  is  placed  with 
its  tail  at  {vi,V2,V3)  then  its  head  is  at  {wi,W2,W3)  and  it  lies  in  the  plane.  As  a  result, 
any  vector  perpendicular  to  the  plane  is  perpendicular  to  {wi  —  vi,W2  —  V2,ws  —  vs).  In 
fact,  it  is  easy  to  see  that  the  plane  consists  of  precisely  those  points  {wi,W2,ws)  for 
which  {wi  —  vi,W2  —  f2,  "^3  —  Vs)  is  perpendicular  to  a  normal  to  the  plane,  as  indicated 
in  figure  12.5.1.  Turning  this  around,  suppose  we  know  that  (a,  6,  c)  is  normal  to  a  plane 
containing  the  point  (f i, t'2, 'I's)-  Then  {x,y,z)  is  in  the  plane  if  and  only  if  {a,h,c)  is 
perpendicular  to  {x  —  vi,y  —  V2,  z  —  v^) ■  In  turn,  we  know  that  this  is  true  precisely  when 
{a,b,c)  ■  {x  —  vi,y  —  V2,  z  —  vs)  =  0.  That  is,  (x,  y,  z)  is  in  the  plane  if  and  only  if 

(a,6,c)  ■  {x-vi,y-V2,z-V3)  =0 
a{x  -  vi)  +  b{y  -  V2)  +  c{z  -  V3)  =  0 
ax  +  by  +  cz  —  avi  —  bv2  —  cf  3  =  0 

ax  +  by  +  cz  —  avi  +  bv2  +  CV3. 

Working  backwards,  note  that  if  (x,  y,  z)  is  a  point  satisfying  ax  +  by  +  cz  =  d  then 

ax  +  by  +  cz  =  d 
ax  +  by  +  cz  —  d  =  0 
a{x  -  d/a)  +  b{y  -  0)  +  0(2  -  0)  =  0 
(a,  6,  c)  ■  {x  —  d/a,  y,  z)  =  0. 

Namely,  (a,  6,  c)  is  perpendicular  to  the  vector  with  tail  at  {d/a,  0,  0)  and  head  at  (x,  y,  z). 
This  means  that  the  points  {x,y,z)  that  satisfy  the  equation  ax  +  by  +  cz  =  d  form  a 
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Figure  12.5.1     A  plane  defined  via  vectors  perpendicular  to  a  normal.  (AP) 

plane  perpendicular  to  (a,  6,  c).  (This  doesn't  work  if  a  =  0,  but  in  that  case  we  can 
use  6  or  c  in  the  role  of  a.  That  is,  either  a{x  —  0)  +  b{y  —  d/b)  +  c{z  —  0)  =  0  or 
a{x  -  0)  +  b{y  -  0)  +  c{z  -  d/c)  =  0.) 

Thus,  given  a  vector  (a,  6,  c)  we  know  that  all  planes  perpendicular  to  this  vector  have 
the  form  ax  +  by  +  cz  =  d,  and  any  surface  of  this  form  is  a  plane  perpendicular  to  (a,  b,  c) . 

EXAMPLE  12.5.1  Find  an  equation  for  the  plane  perpendicular  to  (1,2,3)  and  con- 
taining the  point  (5,  0,  7). 

Using  the  derivation  above,  the  plane  is  Ix  +  2y  +  3z  =  1  ■  5  +  2  ■  0  +  3  •  7  =  26. 
Alternately,  we  know  that  the  plane  is  x  +  2?/  +  3z  =  (i,  and  to  find  d  we  may  substitute 
the  known  point  on  the  plane  to  get  5  +  2  ■  0  +  3  ■  7  =  d,  so  (i  =  26.  □ 

EXAMPLE  12.5.2     Find  a  vector  normal  to  the  plane  2x  —  "iy  +  z  =  15. 

One  example  is  (2,  —3, 1).  Any  vector  parallel  or  anti-parallel  to  this  works  as  well,  so 
for  example  —2(2,  —3, 1)  =  (—4,  6,  —2)  is  also  normal  to  the  plane.  □ 

We  will  frequently  need  to  find  an  equation  for  a  plane  given  certain  information  about 
the  plane.  While  there  may  occasionally  be  slightly  shorter  ways  to  get  to  the  desired  result, 
it  is  always  possible,  and  usually  advisable,  to  use  the  given  information  to  find  a  normal 
to  the  plane  and  a  point  on  the  plane,  and  then  to  find  the  equation  as  above. 
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EXAMPLE  12.5.3  The  planes  x  —  z  =  1  and  y  +  2z  =  3  intersect  in  a  line.  Find  a 
third  plane  that  contains  this  line  and  is  perpendicular  to  the  plane  x  +  y  —  2z  =  1. 

First,  we  note  that  two  planes  are  perpendicular  if  and  only  if  their  normal  vectors 
are  perpendicular.  Thus,  wc  seek  a  vector  {a,b,c)  that  is  perpendicular  to  (1,1,-2).  In 
addition,  since  the  desired  plane  is  to  contain  a  certain  line,  (a,  b,  c)  must  be  perpendicular 
to  any  vector  parallel  to  this  line.  Since  (a,  6,  c)  must  be  perpendicular  to  two  vectors,  we 
may  find  it  by  computing  the  cross  product  of  the  two.  So  we  need  a  vector  parallel  to  the 
line  of  intersection  of  the  given  planes.  For  this,  it  suffices  to  know  two  points  on  the  line. 
To  find  two  points  on  this  line,  we  must  find  two  points  that  are  simultaneously  on  the 
two  planes,  x  —  z  —  1  and  y  +  2z  —  3.  Any  point  on  both  planes  will  satisfy  x  —  z  —  1  and 
y  +  2z  =  S.  It  is  easy  to  find  values  for  x  and  z  satisfying  the  first,  such  as  a;  =  1,  2;  =  0 
and  X  =  2,z  =  1.  Then  we  can  find  corresponding  values  for  y  using  the  second  equation, 
namely  y  =  3  and  j/  =  1,  so  (1, 3,  0)  and  (2, 1, 1)  are  both  on  the  line  of  intersection  because 
both  are  on  both  planes.  Now  (2  —  1,1  —  3,1  —  0)  =  (1,  —2, 1)  is  parallel  to  the  line.  Finally, 
we  may  choose  (a,  6,  c)  =  (1,1,  —2)  x  (1,  —2, 1)  =  (—3,  —3,  —3).  While  this  vector  will  do 
perfectly  well,  any  vector  parallel  or  anti-parallel  to  it  will  work  as  well,  so  for  example  we 
might  choose  (1,1,1)  which  is  anti-parallel  to  it. 

Now  we  know  that  (1, 1, 1)  is  normal  to  the  desired  plane  and  (2, 1, 1)  is  a  point  on 
the  plane.  Therefore  an  equation  of  the  plane  is  x  +  y  +  z  =  4.  As  a  quick  check,  since 
(I,  3,  0)  is  also  on  the  line,  it  should  be  on  the  plane;  since  1  -|-  3  -|-  0  =  4,  we  see  that  this 
is  indeed  the  case. 

Note  that  had  we  used  (—3,  —3,  —3)  as  the  normal,  we  would  have  discovered  the 
equation  —3x  —  3y  —  3z  =  —12,  then  we  might  well  have  noticed  that  we  could  divide  both 
sides  by  —3  to  get  the  equivalent  x  +  y  +  z  =  4.  □ 

So  we  now  understand  equations  of  planes;  let  us  turn  to  lines.  Unfortunately,  it  turns 
out  to  be  quite  inconvenient  to  represent  a  typical  line  with  a  single  equation;  we  need  to 
approach  lines  in  a  different  way. 

Unlike  a  plane,  a  line  in  three  dimensions  does  have  an  obvious  direction,  namely,  the 
direction  of  any  vector  parallel  to  it.  In  fact  a  line  can  be  defined  and  uniquely  identified 
by  providing  one  point  on  the  line  and  a  vector  parallel  to  the  line  (in  one  of  two  possible 
directions).  That  is,  the  line  consists  of  exactly  those  points  we  can  reach  by  starting  at 
the  point  and  going  for  some  distance  in  the  direction  of  the  vector.  Let's  see  how  we  can 
translate  this  into  more  mathematical  language. 

Suppose  a  line  contains  the  point  (^1,^2, 1^3)  and  is  parallel  to  the  vector  {a,b,c).  If 
we  place  the  vector  (^1,^2, 1^3)  with  its  tail  at  the  origin  and  its  head  at  (vi, ^2, 1^3),  and 
if  we  place  the  vector  {a,b,c)  with  its  tail  at  (vi,  ^2, 1^3)5  then  the  head  of  {a,b,c)  is  at  a 
point  on  the  line.  We  can  get  to  any  point  on  the  line  by  doing  the  same  thing,  except 
using  t{a,  b,  c)  in  place  of  (a,  b,  c),  where  t  is  some  real  number.  Because  of  the  way  vector 
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Figure  12.5.2     Vector  form  of  a  line. 

addition  works,  the  point  at  the  head  of  the  vector  i(a,  6,  c)  is  the  point  at  the  head  of  the 
vector  (f  1,  f  2,  V3)  +  t{a,  6,  c),  namely  {vi  +  ta,  f  2  +  tb,  vs  +  tc);  see  figure  12.5.2. 

In  other  words,  as  t  runs  through  aU  possible  real  values,  the  vector  {vi,V2,V3)+t{a,  b,  c) 
points  to  every  point  on  the  line  when  its  tail  is  placed  at  the  origin.  Another  common 
way  to  write  this  is  as  a  set  of  parametric  equations: 

X  =  vi  +  ta       y  =  V2  +  tb       z  =  V3  +  tc. 

It  is  occasionally  useful  to  use  this  form  of  a  line  even  in  two  dimensions;  a  vector  form  for 
a  line  in  the  x-y  plane  is  {vi,  V2)  +  t{a,  6),  which  is  the  same  as  {vi,  V2,  0)  +  t(a,  6,  0). 

EXAMPLE  12.5.4    Find  a  vector  expression  for  the  line  through  (6, 1,  —3)  and  (2, 4,  5). 

To  get  a  vector  parallel  to  the  line  we  subtract  (6, 1,  —3)  —  (2, 4,  5)  =  (4,  —3,  —8).  The  line 
is  then  given  by  (2,4,  5)  +  t(4,  —3,  —8);  there  are  of  course  many  other  possibilities,  such 
as  (6, 1,-3) +t(4, -3, -8).  □ 

EXAMPLE  12.5.5  Determine  whether  the  lines  (1,1,1)  +t(l,2,-l)  and  (3,2,1)  + 
t(— 1,  —5,3)  are  parallel,  intersect,  or  neither. 

In  two  dimensions,  two  lines  either  intersect  or  are  parallel;  in  three  dimensions,  lines 
that  do  not  intersect  might  not  be  parallel.  In  this  case,  since  the  direction  vectors  for  the 
lines  are  not  parallel  or  anti-parallel  we  know  the  lines  are  not  parallel.  If  they  intersect, 
there  must  be  two  values  a  and  b  so  that  (1, 1, 1)  +  a(l,  2,  —1)  =  (3,  2, 1)  +  6(— 1,  —5,  3), 
that  is, 

1  +  a  =  3  -  6 
1  +  2a  =  2  -  56 
1  -  a  =  1  +  36 

This  gives  three  equations  in  two  unknowns,  so  there  may  or  may  not  be  a  solution  in 
general.  In  this  case,  it  is  easy  to  discover  that  a  =  3  and  b  =  —1  satisfies  all  three 
equations,  so  the  lines  do  intersect  at  the  point  (4,  7,  —2).  □ 


EXAMPLE  12.5.6  Find  the  distance  from  the  point  (1,  2,  3)  to  the  plane  2x-y+3z  =  5. 
The  distance  from  a  point  P  to  a  plane  is  the  shortest  distance  from  P  to  any  point  on  the 


12.5    Lines  and  Planes  321 


plane;  this  is  the  distance  measured  from  P  perpendicular  to  the  plane;  see  figure  12.5.3. 

This  distance  is  the  absolute  value  of  the  scalar  projection  of  QP  onto  a  normal  vector  n, 
where  Q  is  any  point  on  the  plane.  It  is  easy  to  find  a  point  on  the  plane,  say  (1,0, 1). 
Thus  the  distance  is 

(0,2,  2) -(2, -1,3)  ^  ^ 
1(2,-1,3)1 

□ 


Figure  12.5.3     Distance  from  a  point  to  a  plane. 


EXAMPLE  12.5.7  Find  the  distance  from  the  point  (-1,2,1)  to  the  line  (1,1,1)  + 
t(2,3,  —1).  Again  we  want  the  distance  measured  perpendicular  to  the  line,  as  indicated 
in  figure  12.5.4.  The  desired  distance  is 

lapxAi 

|QP|sin6'=  ' 


|A|  ' 

where  A  is  any  vector  parallel  to  the  line.  From  the  equation  of  the  line,  we  can  use 
Q  =  (1, 1, 1)  and  A  =  (2,  3,  —1),  so  the  distance  is 

|(-2,1,0)  X  (2,3,-1)1  |(-l,-2,-8)| 


14  V14  V14 

□ 


Figure  12.5.4     Distance  from  a  point  to  a  line. 
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Exercises  12.5. 

1.  Find  an  equation  of  the  plane  containing  (6,  2, 1)  and  perpendicular  to  (1, 1,1). 

2.  Find  an  equation  of  the  plane  containing  (—1,  2,  —3)  and  perpendicular  to  (4,  5,-1).  =r- 

3.  Find  an  equation  of  the  plane  containing  (1,  2,  —3),  (0, 1,  —2)  and  (1,  2,  —2).  =^ 

4.  Find  an  equation  of  the  plane  containing  (1,  0,  0),  (4,  2,  0)  and  (3,  2, 1). 

5.  Find  an  equation  of  the  plane  containing  (1,  0,  0)  and  the  line  (1,  0,  2)  +  t(3,  2, 1).  =^ 

6.  Find  an  equation  of  the  plane  containing  the  line  of  intersection  ofx  +  y  +  z  =  1  and 
X  —  y  +  2z  =  2,  and  perpendicular  to  the  x-y  plane.  =^ 

7.  Find  an  equation  of  the  line  through  (1,  0,  3)  and  (1,  2, 4).  =^ 

8.  Find  an  equation  of  the  line  through  (1,  0,  3)  and  perpendicular  to  the  plane  x  +  2y  —  z  =  1. 

=^ 

9.  Find  an  equation  of  the  line  through  the  origin  and  perpendicular  to  the  plane  x  +  y  —  z  =  2. 
=^ 

10.  Find  a  and  c  so  that  (a,  1,  c)  is  on  the  line  through  (0,  2,  3)  and  (2,  7,  5).  =^> 

11.  Explain  how  to  discover  the  solution  in  example  12.5.5. 

12.  Determine  whether  the  lines  (1,  3,  — 1) +t(l,  1,  0)  and  (0,  0,  0) +t(l,  4,  5)  are  parallel,  intersect, 
or  neither.  =^> 

13.  Determine  whether  the  lines  (1,0,2)  +t(-l,-l,2)  and  (4,4,2)  +t(2,2, -4)  are  parallel, 
intersect,  or  neither.  =^ 

14.  Determine  whether  the  lines  (1,2,-1)  +t(l,2,3)  and  (1,0,1)  +  i(2/3,  2,  4/3)  are  parallel, 
intersect,  or  neither.  =^ 

15.  Determine  whether  the  lines  (1,1,2)  +t(l,2,— 3)  and  (2,3,-1)  +t(2,4,  — 6)  are  parallel, 
intersect,  or  neither. 

16.  Find  a  unit  normal  vector  to  each  of  the  coordinate  planes. 

17.  Show  that  (2, 1,  3)  +  t{l,  1,  2)  and  (3,  2,  5)  +  s{2,  2,  4)  are  the  same  line. 

18.  Give  a  prose  description  for  each  of  the  following  processes: 

a.  Given  two  distinct  points,  find  the  line  that  goes  through  them. 

b.  Given  three  points  (not  all  on  the  same  line),  find  the  plane  that  goes  through  them. 
Why  do  we  need  the  caveat  that  not  all  points  be  on  the  same  line? 

c.  Given  a  line  and  a  point  not  on  the  line,  find  the  plane  that  contains  them  both. 

d.  Given  a  plane  and  a  point  not  on  the  plane,  find  the  line  that  is  perpendicular  to  the 
plane  through  the  given  point. 

19.  Find  the  distance  from  (2,  2,  2)  to  x  +  y  +  z  =  —1.  =^ 

20.  Find  the  distance  from  (2,  —1,  —1)  to  2x  —  3y  +  z  =  2.  =^ 

21.  Find  the  distance  from  (2,  -1, 1)  to  (2,  2,  0)  +  t{l,  2,3).  =^ 

22.  Find  the  distance  from  (1,0, 1)  to  (3,  2, 1)  +  i(2, -1, -2).  =^ 

23.  Find  the  cosine  of  the  angle  between  the  planes  x  +  y  -\-  z  =  2  and  x  +  2y  -\-  3z  =  8. 

24.  Find  the  cosine  of  the  angle  between  the  planes  x  —  y  -\-  2z  =  2  and  3x  —  2y  -\-  z  =  5.  =^> 
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12.6    Other  Coordinate  Systems 

Coordinate  systems  are  tools  that  let  us  use  algebraic  methods  to  understand  geometry. 
While  the  rectangular  (also  called  Cartesian)  coordinates  that  we  have  been  discussing 
are  the  most  common,  some  problems  are  easier  to  analyze  in  alternate  coordinate  systems. 

A  coordinate  system  is  a  scheme  that  allows  us  to  identify  any  point  in  the  plane  or 
in  three-dimensional  space  by  a  set  of  numbers.  In  rectangular  coordinates  these  numbers 
are  interpreted,  roughly  speaking,  as  the  lengths  of  the  sides  of  a  rectangular  "box." 

In  two  dimensions  you  may  already  be  familiar  with  an  alternative,  called  polar  co- 
ordinates. In  this  system,  each  point  in  the  plane  is  identified  by  a  pair  of  numbers  (r,  9) . 
The  number  9  measures  the  angle  between  the  positive  x-axis  and  a  vector  with  tail  at 
the  origin  and  head  at  the  point,  as  shown  in  figure  12.6.1;  the  number  r  measures  the 
distance  from  the  origin  to  the  point.  Either  of  these  may  be  negative;  a  negative  9  indi- 
cates the  angle  is  measured  clockwise  from  the  positive  x-axis  instead  of  counter-clockwise, 
and  a  negative  r  indicates  the  point  at  distance  |r|  in  the  opposite  of  the  direction  given 
by  9.  Figure  12.6.1  also  shows  the  point  with  rectangular  coordinates  (1,  and  polar 
coordinates  (2,7r/3),  2  units  from  the  origin  and  tc/3  radians  from  the  positive  x-axis. 


(2,7r/3) 


r  cos^ 


Figure  12.6.1     Polar  coordinates:  the  general  case  and  the  point  with  rectangular  coordi- 
nates (1,  Vs). 


We  can  extend  polar  coordinates  to  three  dimensions  simply  by  adding  a  z  coordi- 
nate; this  is  called  cylindrical  coordinates.  Each  point  in  three-dimensional  space  is 
represented  by  three  coordinates  (r,  6*,  z)  in  the  obvious  way:  this  point  is  z  units  above 
or  below  the  point  {r,9)  in  the  x-y  plane,  as  shown  in  figure  12.6.2.  The  point  with  rect- 
angular coordinates  (1,a/3,  3)  and  cylindrical  coordinates  (2, 7r/3,3)  is  also  indicated  in 
figure  12.6.2. 

Some  figures  with  relatively  complicated  equations  in  rectangular  coordinates  will  be 
represented  by  simpler  equations  in  cylindrical  coordinates.  For  example,  the  cylinder  in 
figure  12.6.3  has  equation  -\-  =  4  in  rectangular  coordinates,  but  equation  r  =  2  in 
cylindrical  coordinates. 
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'(2,7r/3,3) 


Figure  12.6.2     Cylindrical  coordinates:   the  general  case  and  the  point  with  rectangular 
coordinates  (1,  \/3,  3). 


2  2 


Figure  12.6.3     The  cylinder  r  =  2. 

Given  a  point  (r,  6)  in  polar  coordinates,  it  is  easy  to  see  (as  in  figure  12.6.1)  that  the 
rectangular  coordinates  of  the  same  point  are  (r  cos^,  r  sin^),  and  so  the  point  (r,9,z)  in 
cylindrical  coordinates  is  (r  cos  ^,  r  sin^,  2;)  in  rectangular  coordinates.  This  means  it  is 
usually  easy  to  convert  any  equation  from  rectangular  to  cylindrical  coordinates:  simply 
substitute 

X  =  r  cos  6 


y  =  rsin^ 
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and  leave  z  alone.  For  example,  starting  with  +  =  4  and  substituting  x  =  r  cos  ^, 
y  =  r  sin  9  gives 

cos^  9  +  r'^  sin^  9  =  4 
r^(cos^  9  +  sin^  9)  =  4 
=  4 
r  =  2. 

Of  course,  it's  easy  to  see  directly  that  this  defines  a  cylinder  as  mentioned  above. 

Cylindrical  coordinates  are  an  obvious  extension  of  polar  coordinates  to  three  dimen- 
sions, but  the  use  of  the  z  coordinate  means  they  are  not  as  closely  analogous  to  polar 
coordinates  as  another  standard  coordinate  system.  In  polar  coordinates,  we  identify  a 
point  by  a  direction  and  distance  from  the  origin;  in  three  dimensions  we  can  do  the  same 
thing,  in  a  variety  of  ways.  The  question  is:  how  do  we  represent  a  direction?  One  way  is 
to  give  the  angle  of  rotation,  6*,  from  the  positive  x  axis,  just  as  in  cylindrical  coordinates, 
and  also  an  angle  of  rotation,  cp,  from  the  positive  z  axis.  Roughly  speaking,  9  is  like  lon- 
gitude and  (j)  is  like  latitude.  (Earth  longitude  is  measured  as  a  positive  or  negative  angle 
from  the  prime  meridian,  and  is  always  between  0  and  180  degrees,  east  or  west;  9  can  be 
any  positive  or  negative  angle,  and  we  use  radians  except  in  informal  circumstances.  Earth 
latitude  is  measured  north  or  south  from  the  equator;  (p  is  measured  from  the  north  pole 
down.)  This  system  is  called  spherical  coordinates;  the  coordinates  are  listed  in  the 
order  (p,  9,  (p),  where  p  is  the  distance  from  the  origin,  and  like  r  in  cylindrical  coordinates 
it  may  be  negative.  The  general  case  and  an  example  are  pictured  in  figure  12.6.4;  the 
length  marked  r  is  the  r  of  cylindrical  coordinates. 


z  z 


X  X 

Figure  12.6.4     Spherical  coordinates:  the  general  case  and  the  point  with  rectangular  coor- 
dinates (1,  v^,  3). 

As  with  cylindrical  coordinates,  we  can  easily  convert  equations  in  rectangular  coordi- 
nates to  the  equivalent  in  spherical  coordinates,  though  it  is  a  bit  more  difficult  to  discover 
the  proper  substitutions.  Figure  12.6.5  shows  the  typical  point  in  spherical  coordinates 
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from  figure  12.6.4,  viewed  now  so  that  the  arrow  marked  r  in  the  original  graph  appears  as 
the  horizontal  "axis"  in  the  left  hand  graph.  From  this  diagram  it  is  easy  to  see  that  the  z 
coordinate  is  pcoscj),  and  that  r  =  psincf),  as  shown.  Thus,  in  converting  from  rectangular 
to  spherical  coordinates  we  will  replace  zhj  p  cos  (f).  To  see  the  substitutions  for  x  and  y  we 
now  view  the  same  point  from  above,  as  shown  in  the  right  hand  graph.  The  hypotenuse  of 
the  triangle  in  the  right  hand  graph  is  r  =  psin^,  so  the  sides  of  the  triangle,  as  shown,  are 
X  =  rcos9  =  psin(/)cos^  and  y  =  rsin^  =  p  sin  (p  sin  9.  So  the  upshot  is  that  to  convert 
from  rectangular  to  spherical  coordinates,  we  make  these  substitutions: 

X  =  p  sin  (j)  cos  6 
y  =  p  sin  (j)  sin  9 
z  =  pcoscf). 


p  sin  (j)  cos  6 


Figure  12.6.5     Converting  from  rectangular  to  spherical  coordinates. 

EXAMPLE  12.6.1  As  the  cylinder  had  a  simple  equation  in  cylindrical  coordinates, 
so  does  the  sphere  in  spherical  coordinates:  p  =  2  is  the  sphere  of  radius  2.  If  we  start 
with  the  Cartesian  equation  of  the  sphere  and  substitute,  we  get  the  spherical  equation: 


2    1      2   1  2 

X  +  y  +  z 

=  2^ 

p2  sin^  (p  cos^  9  +  p"^  sin^  (p  sin^  9  +  p^  cos^  (p 

=  22 

p2  sin^  (^(cos^  9  +  sin^  9)  +  p^  cos^  cp 

=  22 

p2  sin^  (p  +  p'^  cos^  (p 

=  22 

p2(sin2  (p  +  cos^  (p) 

=  22 

=  22 

P 

=  2 

□ 
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EXAMPLE  12.6.2  Find  an  equation  for  the  cylinder  +  =  4  in  spherical  coordi- 
nates. 

Proceeding  as  in  the  previous  example: 

sin^  (f)  cos^  9  -\-  p^  s\v?  (p  sin^  6  =  4 

sin^  ^(cos^  0  +  sin^  9)  =  A 
sin'^  =  4 
p  sin  (j)  =  2 

2 

P 


□ 

Exercises  12.6. 

1.  Convert  the  following  points  in  rectangular  coordinates  to  cylindrical  and  spherical  coordi- 
nates: 

a.  (1,1,1) 

b.  (7,-7,5) 

c.  (cos(l),  sin(l),  1) 

d.  (0,0, -tt)  ^ 

2.  Find  an  equation  for  the  sphere  x'^  +      -|-  z'^  =  4  in  cylindrical  coordinates.  =J> 

3.  Find  an  equation  for  the  y-z  plane  in  cylindrical  coordinates.  =^> 

2         2  2 

4.  Find  an  equation  equivalent  to  x  -\-  y  -\-2z  -|-  2z  —  5  =  0  in  cylindrical  coordinates. 

_  2 

5.  Suppose  the  curve  z  =  e       in  the  x-z  plane  is  rotated  around  the  z  axis.  Find  an  equation 
for  the  resulting  surface  in  cylindrical  coordinates.  =f- 

6.  Suppose  the  curve  z  =  x  \n  the  x-z  plane  is  rotated  around  the  z  axis.  Find  an  equation  for 
the  resulting  surface  in  cylindrical  coordinates.  =^> 

7.  Find  an  equation  for  the  plane  y  =  0  in  spherical  coordinates.  =J> 

8.  Find  an  equation  for  the  plane  z  =  1  \n  spherical  coordinates.  =^ 

9.  Find  an  equation  for  the  sphere  with  radius  1  and  center  at  (0, 1,  0)  in  spherical  coordinates. 

=^ 

10.  Find  an  equation  for  the  cylinder      -|-  y'^  =  4  in  spherical  coordinates. 

11.  Suppose  the  curve  z  =  x  in  the  x-z  plane  is  rotated  around  the  z  axis.  Find  an  equation  for 
the  resulting  surface  in  spherical  coordinates.  =^> 

12.  Plot  the  polar  equations  r  =  sin(0)  and  r  =  cos{6)  and  comment  on  their  similarities.  (If 
you  get  stuck  on  how  to  plot  these,  you  can  multiply  both  sides  of  each  equation  by  r  and 
convert  back  to  rectangular  coordinates). 

13.  Extend  exercises  6  and  11  by  rotating  the  curve  z  =  mx  around  the  z  axis  and  converting  to 
both  cylindrical  and  spherical  coordinates. 
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14.  Convert  the  spherical  formula  p  =  sin 0 sine/)  to  rectangular  coordinates  and  describe  the 
surface  defined  by  the  formula  (Hint:  Multiply  both  sides  by  p.)  =^> 

15.  We  can  describe  points  in  the  first  octant  by  a;  >  0,  y  >  0  and  z  >  0.  Give  similar  inequalities 
for  the  first  octant  in  cylindrical  and  spherical  coordinates. 


13 

Vector  Functions 


13.1    Space  Curves 

We  have  already  seen  that  a  convenient  way  to  describe  a  Une  in  three  dimensions  is  to 
provide  a  vector  that  "points  to"  every  point  on  the  hne  as  a  parameter  t  varies,  hke 

(1,  2,  3)  +  t{l,  -2,  2)  =  (1  +  t,  2  -  2t,  3  +  2t). 

Except  that  this  gives  a  particularly  simple  geometric  object,  there  is  nothing  special  about 
the  individual  functions  of  t  that  make  up  the  coordinates  of  this  vector — any  vector  with 
a  parameter,  like  {f{t),g{t),  h(t)),  will  describe  some  curve  in  three  dimensions  as  t  varies 
through  all  possible  values. 

EXAMPLE  13.1.1   Describe  the  curves  (cost,  sint,  0),  (cost,  sint,  t),  and  (cost,  sint,  2t). 

As  t  varies,  the  first  two  coordinates  in  all  three  functions  trace  out  the  points  on  the 
unit  circle,  starting  with  (1,0)  when  t  =  0  and  proceeding  counter-clockwise  around  the 
circle  as  t  increases.  In  the  first  case,  the  z  coordinate  is  always  0,  so  this  describes  precisely 
the  unit  circle  in  the  x-y  plane.  In  the  second  case,  the  x  and  y  coordinates  still  describe 
a  circle,  but  now  the  z  coordinate  varies,  so  that  the  height  of  the  curve  matches  the  value 
of  t.  When  t  =  n,  for  example,  the  resulting  vector  is  (— l,0,7r).  A  bit  of  thought  should 
convince  you  that  the  result  is  a  helix.  In  the  third  vector,  the  z  coordinate  varies  twice  as 
fast  as  the  parameter  t,  so  we  get  a  stretched  out  helix.  Both  are  shown  in  figure  13.1.1. 
On  the  left  is  the  first  helix,  shown  for  t  between  0  and  47r;  on  the  right  is  the  second  helix, 
shown  for  t  between  0  and  2tt.  Both  start  and  end  at  the  same  point,  but  the  first  helix 
takes  two  full  "turns"  to  get  there,  because  its  z  coordinate  grows  more  slowly.  □ 
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Figure  13.1.1     Two  helixes. 


A  vector  expression  of  the  form  {f{t),g{t),h{t))  is  called  a  vector  function;  it  is 
a  function  from  the  real  numbers  M  to  the  set  of  all  three-dimensional  vectors.  We  can 
alternately  think  of  it  as  three  separate  functions,  x  =  f{t),  y  =  g{t),  and  z  =  h{t)^  that 
describe  points  in  space.  In  this  case  we  usually  refer  to  the  set  of  equations  as  parametric 
equations  for  the  curve,  just  as  for  a  line.  While  the  parameter  t  in  a  vector  function 
might  represent  any  one  of  a  number  of  physical  quantities,  or  be  simply  a  "pure  number" , 
it  is  often  convenient  and  useful  to  think  of  t  as  representing  time.  The  vector  function 
then  tells  you  where  in  space  a  particular  object  is  at  any  time. 

Vector  functions  can  be  difficult  to  understand,  that  is,  difficult  to  picture.  When 
available,  computer  software  can  be  very  helpful.  When  working  by  hand,  one  useful 
approach  is  to  consider  the  "projections"  of  the  curve  onto  the  three  standard  coordinate 
planes.  We  have  already  done  this  in  part:  in  example  13.1.1  we  noted  that  all  three  curves 
project  to  a  circle  in  the  x-y  plane,  since  (cost,  sint)  is  a  two  dimensional  vector  function 
for  the  unit  circle. 

EXAMPLE  13.1.2  Graph  the  projections  of  (cost,  sint,  2t)  onto  the  x-z  plane  and  the 
y-z  plane.  The  two  dimensional  vector  function  for  the  projection  onto  the  x-z  plane  is 
(cost,2t),  or  in  parametric  form,  x  =  cost,  z  =  2t.  By  eliminating  t  we  get  the  equation 
X  =  cos{z/2),  the  familiar  curve  shown  on  the  left  in  figure  13.1.2.  For  the  projection  onto 
the  y-z  plane,  we  start  with  the  vector  function  (sint,2t),  which  is  the  same  as  y  =  sint, 
z  =  2t.  Eliminating  t  gives  y  =  sin(2;/2),  as  shown  on  the  right  in  figure  13.1.2.  □ 

Exercises  13.1. 

1.  Investigate  the  curve  r  =  (sin t,  cost,  cos 8t). 

2.  Investigate  the  curve  r  =  (tcost,tsint,  t). 
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Figure  13.1.2     The  projections  of  {cost,  sin  t,2t)  onto  the  x-z  and  y-z  planes. 

3.  Investigate  the  curve  r  =  {t,t'^  ,cost). 

4.  Investigate  the  curve  r 

5.  Find  a  vector  function  for  the  curve  of  intersection  of      +      =  9  and  y  +  z  =  2.  =^> 

6.  A  bug  is  crawling  outward  along  the  spoke  of  a  wheel  that  lies  along  a  radius  of  the  wheel. 
The  bug  is  crawling  at  1  unit  per  second  and  the  wheel  is  rotating  at  1  radian  per  second. 
Suppose  the  wheel  lies  in  the  y-z  plane  with  center  at  the  origin,  and  at  time  t  =  0  the  spoke 
lies  along  the  positive  y  axis  and  the  bug  is  at  the  origin.  Find  a  vector  function  r(t)  for  the 
position  of  the  bug  at  time  t. 

7.  What  is  the  difference  between  the  parametric  curves  f{t)  =  {t,t,t^),  g(t)  =  {t^,t^,t'^),  and 
h{t)  =  (sin(t),  sin(t),  sin'^(t))as  t  runs  over  all  real  numbers? 

8.  Plot  each  of  the  curves  below  in  2  dimensions,  projected  onto  each  of  the  three  standard 
planes  (the  x-y,  x-z,  and  y-z  planes). 

a.  f{t)  =  {t,t^ ,t^),  t  ranges  over  all  real  numbers 

b.  /(t)  =  (t^t  -  l,i2  +  5)  for  0  <  i  <  3 

9.  Given  points  A  =  (01,02,03)  and  B  =  (61,62,63),  give  parametric  equations  for  the  line 
segment  connecting  A  and  B.  Be  sure  to  give  appropriate  t  values. 

10.  With  a  parametric  plot  and  a  set  of  t  values,  we  can  associate  a  'direction'.  For  example, 
the  curve  (cost,  sin is  the  unit  circle  traced  counterclockwise.  How  can  we  amend  a  set  of 
given  parametric  equations  and  t  values  to  get  the  same  curve,  only  traced  backwards? 

13.2    Calculus  with  vector  functions 

A  vector  function  r{t)  =  {f{t),g(t),h{t))  is  a  function  of  one  variable — that  is,  there  is 
only  one  "input"  value.  What  makes  vector  functions  more  complicated  than  the  functions 
y  =  f{x)  that  we  studied  in  the  first  part  of  this  book  is  of  course  that  the  "output"  values 
are  now  three-dimensional  vectors  instead  of  simply  numbers.  It  is  natural  to  wonder  if 
there  is  a  corresponding  notion  of  derivative  for  vector  functions.  In  the  simpler  case  of 
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a  function  y  =  s{t),  in  which  t  represents  time  and  s{t)  is  position  on  a  hne,  we  have 
seen  that  the  derivative  s'{t)  represents  velocity;  we  might  hope  that  in  a  similar  way  the 
derivative  of  a  vector  function  would  tell  us  something  about  the  velocity  of  an  object 
moving  in  three  dimensions. 

One  way  to  approach  the  question  of  the  derivative  for  vector  functions  is  to  write 
down  an  expression  that  is  analogous  to  the  derivative  we  already  understand,  and  see  if 
we  can  make  sense  of  it.  This  gives  us 


if  we  say  that  what  we  mean  by  the  limit  of  a  vector  is  the  vector  of  the  individual 
coordinate  limits.  So  starting  with  a  familiar  expression  for  what  appears  to  be  a  derivative, 
we  find  that  we  can  make  good  computational  sense  out  of  it — but  what  does  it  actually 
mean? 

We  know  how  to  interpret  r{t  +  At)  and  r{t) — they  are  vectors  that  point  to  locations 
in  space;  if  t  is  time,  we  can  think  of  these  points  as  positions  of  a  moving  object  at  times 
that  are  At  apart.  We  also  know  what  Ar  =  r{t  +  At)  —  r{t)  means — it  is  a  vector  that 
points  from  the  head  of  r{t)  to  the  head  of  r{t  +  At),  assuming  both  have  their  tails  at 
the  origin.  So  when  At  is  small,  Ar  is  a  tiny  vector  pointing  from  one  point  on  the  path 
of  the  object  to  a  nearby  point.  As  At  gets  close  to  0,  this  vector  points  in  a  direction 
that  is  closer  and  closer  to  the  direction  in  which  the  object  is  moving;  geometrically,  it 
approaches  a  vector  tangent  to  the  path  of  the  object  at  a  particular  point. 


{fit  +  At)     fit),  git  +  At)     git)Jiit  +  At)  hft)) 


Figure  13.2.1     Approximating  the  derivative. 


Unfortunately,  the  vector  Ar  approaches  0  in  length;  the  vector  (0,0,0)  is  not  very 
informative.  By  dividing  by  At,  when  it  is  small,  we  effectively  keep  magnifying  the  length 
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of  Ar  so  that  in  the  hmit  it  doesn't  disappear.  Thus  the  hmiting  vector  {f'{t),g'{t),  h'{t)) 
will  (usually)  be  a  good,  non-zero  vector  that  is  tangent  to  the  curve. 

What  about  the  length  of  this  vector?  It's  nice  that  we've  kept  it  away  from  zero, 
but  what  does  it  measure,  if  anything?  Consider  the  length  of  one  of  the  vectors  that 
approaches  the  tangent  vector: 


r{t  +  At)  -r{t) 


At 


|r(t  + At)  -r(t)| 
\At\ 


The  numerator  is  the  length  of  the  vector  that  points  from  one  position  of  the  object  to  a 
"nearby"  position;  this  length  is  approximately  the  distance  traveled  by  the  object  between 
times  t  and  t  +  At.  Dividing  this  distance  by  the  length  of  time  it  takes  to  travel  that 
distance  gives  the  average  speed.  As  At  approaches  zero,  this  average  speed  approaches 
the  actual,  instantaneous  speed  of  the  object  at  time  t. 

So  by  performing  an  "obvious"  calculation  to  get  something  that  looks  like  the  deriva- 
tive of  r(t),  we  get  precisely  what  we  would  want  from  such  a  derivative:  the  vector  r'(t) 
points  in  the  direction  of  travel  of  the  object  and  its  length  tells  us  the  speed  of  travel.  In 
the  case  that  t  is  time,  then,  we  call  v(t)  =  r'(t)  the  velocity  vector.  Even  if  t  is  not  time, 
r'(t)  is  useful — it  is  a  vector  tangent  to  the  curve. 

EXAMPLE  13.2.1    We  have  seen  that  r  =  (cost,  sint,  t)  is  a  helix.    We  compute 


r'  =  (— sint,  cost,  1),  and  |r'|  =  y  sin^  t  +  cos^  t  +  1  =  -\/2.  So  thinking  of  this  as  a 
description  of  a  moving  object,  its  speed  is  always  a/2;  see  figure  13.2.2.  For  an  animated 
view  of  the  tangent  vectors,  see  the  Java  applet.  □ 


Figure  13.2.2     A  tangent  vector  on  the  helix. 
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EXAMPLE  13.2.2  The  velocity  vector  for  (cos  t,  sin  t,  cos  t)  is  (—  sin  t,  cos  t,  —  sin  t) .  As 
before,  the  first  two  coordinates  mean  that  from  above  this  curve  looks  like  a  circle.  The 
z  coordinate  is  now  also  periodic,  so  that  as  the  object  moves  around  the  curve  its  height 
oscillates  up  and  down.  In  fact  it  turns  out  that  the  curve  is  a  tilted  ellipse,  as  shown  in 
figure  13.2.3.  □ 


Figure  13.2.3     The  ellipse  r  =  (cos  t,  sin    cos  t). 


EXAMPLE  13.2.3  The  velocity  vector  for  (cos  t,  sin  t,  cos  2t)  is  (—  sin  t,  cos  t,  — 2  sin  2t) . 
The  z  coordinate  is  now  oscillating  twice  as  fast  as  in  the  previous  example,  so  the  graph 
is  not  surprising;  see  figure  13.2.4.  □ 


Figure  13.2.4     (cost,  sin t,  cos  2i;). 
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EXAMPLE  13.2.4  Find  the  angle  between  the  curves  {t,l-t,3+t'^)  and  {3-t,t-2,t'^) 
where  they  meet. 

The  angle  between  two  curves  at  a  point  is  the  angle  between  their  tangent  vectors — 
any  tangent  vectors  will  do,  so  we  can  use  the  derivatives.  We  need  to  find  the  point  of 
intersection,  evaluate  the  two  derivatives  there,  and  finally  find  the  angle  between  them. 

To  find  the  point  of  intersection,  we  need  to  solve  the  equations 

t  =  3  —  u 

l-t=u-2 
3  +  ^2  = 

Solving  either  of  the  first  two  equations  for  u  and  substituting  in  the  third  gives  3  + = 
(3  -  which  means  t  =  1.  This  together  with  u  =  2  satisfies  all  three  equations.  Thus 
the  two  curves  meet  at  (1,  0, 4),  the  first  when  t  =  1  and  the  second  when  t  —  2. 

The  derivatives  are  (1,-1, 2t)  and  l,l,2t);  at  the  intersection  point  these  are 
(1,-1,2)  and  (—1, 1, 4).  The  cosine  of  the  angle  between  them  is  then 

„     -1-1  +  8  1 
cos  6^  = 


so  ^  =  arccos(l/\/3)  fti  0.96.  □ 

The  derivatives  of  vector  functions  obey  some  familiar  looking  rules,  which  we  will 
occasionally  need. 

THEOREM  13.2.5  Suppose  r{t)  and  s{t)  are  differentiable  functions,  f{t)  is  a  differ- 
entiable  function,  and  a  is  a  real  number. 

a.  -^ar(t)  =  ar'{t) 

(JiL 

b.  |(r(t)  +  s(t))=r'(t)  +  s'(t) 

c.  |/(t)r(t)  =  /(t)r'(t)  +  /(t)r(t) 

d.  |(r(t).s(t))  =  r'(t).s(t)  +  r(t).s'(t) 

e.  ^(r(t)  X  s{t))  =  r'{t)  x  s{t)  +  r{t)  x  s'{t) 

f.  |r(/(t))  =  r'(/(t))/'(t) 
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Note  that  because  the  cross  product  is  not  commutative  you  must  remember  to  do  the 
three  cross  products  in  formula  (e)  in  the  correct  order. 

When  the  derivative  of  a  function  f(t)  is  zero,  we  know  that  the  function  has  a  hori- 
zontal tangent  line,  and  may  have  a  local  maximum  or  minimum  point.  If  r'(t)  =  0,  the 
geometric  interpretation  is  quite  different,  though  the  interpretation  in  terms  of  motion 
is  similar.  Certainly  we  know  that  the  object  has  speed  zero  at  such  a  point,  and  it  may 
thus  be  abruptly  changing  direction.  In  three  dimensions  there  are  many  ways  to  change 
direction;  geometrically  this  often  means  the  curve  has  a  cusp  or  a  point,  as  in  the  path 
of  a  ball  that  bounces  off  the  floor  or  a  wall. 

EXAMPLE  13.2.6  Suppose  that  r(t)  =  (1  +  1),  so  r'{t)  =  (St^,  2t,  0).  This  is  0 
at  t  =  0,  and  there  is  indeed  a  cusp  at  the  point  (1,  0, 1),  as  shown  in  figure  13.2.5.  □ 


Figure  13.2.5     (1  +  1)  has  a  cusp  at  (1,0, 1). 

Sometimes  we  will  be  interested  in  the  direction  of  r'  but  not  its  length.  In  some 
cases,  we  can  still  work  with  r',  as  when  we  find  the  angle  between  two  curves.  On  other 
occasions  it  will  be  useful  to  work  with  a  unit  vector  in  the  same  direction  as  r';  of  course, 
we  can  compute  such  a  vector  by  dividing  r'  by  its  own  length.  This  standard  unit  tangent 
vector  is  usually  denoted  by  T: 


In  a  sense,  when  we  computed  the  angle  between  two  tangent  vectors  we  have  already 
made  use  of  the  unit  tangent,  since 
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Now  that  we  know  how  to  make  sense  of  r',  we  immediately  know  what  an  antideriva- 
tive  must  be,  namely 


J  r{t)  dt={j  fit)  dt,  J  g{t)  dt,  j  h{t)  dt), 


if  r  =  {f{t),g{t),  h{t)).  What  about  definite  integrals?  Suppose  that  v(t)  gives  the  velocity 
of  an  object  at  time  t.  Then  v(t)At  is  a  vector  that  approximates  the  displacement  of  the 
object  over  the  time  Ai:  v(t)Ai  points  in  the  direction  of  travel,  and  |v(t)At|  =  |v(t)||At| 
is  the  speed  of  the  object  times  At,  which  is  approximately  the  distance  traveled.  Thus,  if 
we  sum  many  such  tiny  vectors: 

n-l 


J]v(t,)At 


i=0 


we  get  an  approximation  to  the  displacement  vector  over  the  time  interval  [to,^n]-  If  we 
take  the  limit  we  get  the  exact  value  of  the  displacement  vector: 


n-l  „t 


v(t)  dt  =  r{t^)  -  r{to) 


Thus,  given  the  velocity  vector  we  can  compute  the  vector  function  r  giving  the  location 
of  the  object: 


r{t)  =  tq  +      v(m)  du. 
Jo 


EXAMPLE  13.2.7  An  object  moves  with  velocity  vector  (cost,  sin t,  cost),  starting  at 
(1,1,1).  Find  the  function  r  giving  its  location. 

r(t)  =  (1, 1, 1)  +  /  (cos  w,  sinu,  cos  w)  (iw 
Jo 

—  (1, 1, 1)  +  (sinw,  —  cos w,  sinu)|Q 

=  (1, 1, 1)  +  (sint,  -  cost,  sint)  -  (0,  -1,  0) 

=  (1  +  sint,  2  —  cost,  1  +  sint) 

See  figure  13.2.6.  □ 
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Figure  13.2.6     Path  of  the  object  with  its  initial  velocity  vector. 

Exercises  13.2. 

1.  Find  r'  and  T  for  r  =  {t^ ,  l,t).  =^ 

2.  Find  r'  and  T  for  r  =  (cosi,sin2t,t^).  =^ 

3.  Find  r'  and  T  for  r  =  (cos(e*),  sin(e*),  sint).  =^ 

4.  Find  a  vector  function  for  the  line  tangent  to  the  helix  (cost,  sint,  t)  when  t  =  n/A. 

5.  Find  a  vector  function  for  the  line  tangent  to  (cost,  sint,  cos4t)  when  t  =  tt/3.  =^ 

6.  Find  the  cosine  of  the  angle  between  the  curves  (0,  t^,t)  and  (cos(7rt/2),  sin(7rt/2),  t)  where 
they  intersect. 

7.  Find  the  cosine  of  the  angle  between  the  curves  (cost,  —  sin(t)/4,  sin  t)  and  (cos  t,  sin  t,  sin(2t)) 
where  they  intersect.  =^ 

8.  Suppose  that  |r(t)|  =  k,  for  some  constant  k.  This  means  that  r  describes  some  path  on 
the  sphere  of  radius  k  with  center  at  the  origin.  Show  that  r  is  perpendicular  to  r'  at  every 
point.  Hint:  Use  Theorem  13.2.5,  part  (d). 

9.  A  bug  is  crawling  along  the  spoke  of  a  wheel  that  lies  along  a  radius  of  the  wheel.  The  bug  is 
crawling  at  1  unit  per  second  and  the  wheel  is  rotating  at  1  radian  per  second.  Suppose  the 
wheel  lies  in  the  y-z  plane  with  center  at  the  origin,  and  at  time  t  =  0  the  spoke  lies  along 
the  positive  y  axis  and  the  bug  is  at  the  origin.  Find  a  vector  function  r(t)  for  the  position 
of  the  bug  at  time  t,  the  velocity  vector  r'(t),  the  unit  tangent  T(t),  and  the  speed  of  the 
bug  |r'(t)|.  ^ 

10.  An  object  moves  with  velocity  vector  (cost,  sint,  t),  starting  at  (0,  0,  0)  when  t  =  0.  Find  the 
function  r  giving  its  location. 

11.  The  position  function  of  a  particle  is  given  by  r(t)  =  (t^5t,t^  -  16t),  t  >  0.  When  is  the 
speed  of  the  particle  a  minimum?  =^> 

12.  A  particle  moves  so  that  its  position  is  given  by  (cos  t,  sin  t,  cos(6t)).  Find  the  maximum  and 
minimum  speeds  of  the  particle.  =^ 

13.  An  object  moves  with  velocity  vector  (t,  t'^,  cos  t),  starting  at  (0,0,0)  when  t  =  0.  Find  the 
function  r  giving  its  location.  =^ 
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14.  What  is  the  physical  interpretation  of  the  dot  product  of  two  vector  valued  functions?  What 
is  the  physical  interpretation  of  the  cross  product  of  two  vector  valued  functions? 

15.  Show,  using  the  rules  of  cross  products  and  differentiation,  that 

|(r(t)xr'(t))  =  r(t)xr"(i). 

16.  Determine  the  point  at  which  f(t)  =  {t,t'^,t^)  and  g(i)  =  (cos(t),  cos(2i),  i  +  1)  intersect,  and 
find  the  angle  between  the  curves  at  that  point.  (Hint:  You'll  need  to  set  this  one  up  like 
a  line  intersection  problem,  writing  one  in  s  and  one  in  t.)  If  these  two  functions  were  the 
trajectories  of  two  airplanes  on  the  same  scale  of  time,  would  the  planes  collide  at  their  point 
of  intersection?  Explain.  =^ 

17.  Find  the  equation  of  the  plane  perpendicular  to  the  curve  r{t)  =  (2  sin(3t),  2  cos(3t))  at  the 
point  (0,  TT,  —2). 

18.  Find  the  equation  of  the  plane  perpendicular  to  (cost,  sin cos(6i))  when  t  =  7r/4.  =^ 

19.  At  what  point  on  the  curve  r{t)  =  {1^,31,1"^)  is  the  plane  perpendicular  to  the  curve  also 
parallel  to  the  plane  Qx  +  6y  —  8z  =  1?  =^> 

20.  Find  the  equation  of  the  line  tangent  to  (cos    sint,  cos(6t))  when  t  =  7r/4.  =^ 


13.3    Arc  length  and  curvature 

Sometimes  it  is  useful  to  compute  the  length  of  a  curve  in  space;  for  example,  if  the  curve 
represents  the  path  of  a  moving  object,  the  length  of  the  curve  between  two  points  may 
be  the  distance  traveled  by  the  object  between  two  times. 

Recall  that  if  the  curve  is  given  by  the  vector  function  r  then  the  vector  Ar  =  r{t  + 
At)  —  r{t)  points  from  one  position  on  the  curve  to  another,  as  depicted  in  figure  13.2.1.  If 
the  points  are  close  together,  the  length  of  Ar  is  close  to  the  length  of  the  curve  between 
the  two  points.  If  we  add  up  the  lengths  of  many  such  tiny  vectors,  placed  head  to  tail 
along  a  segment  of  the  curve,  we  get  an  approximation  to  the  length  of  the  curve  over  that 
segment.  In  the  limit,  as  usual,  this  sum  turns  into  an  integral  that  computes  precisely 
the  length  of  the  curve.  First,  note  that 

|Ar|  =         At  ^  \r'(t)\At, 

II  I    wi  , 

when  At  is  small.  Then  the  length  of  the  curve  between  r(a)  and  r(6)  is 

n-l  n-1  i^^i  n-1  b 

lim  V  lArl  =  lim        ^——^  At  =  lim  y^\r'(t)\At=  /  \r'(t)\dt. 

i=0  i=0  i=0  "^^ 


(Well,  sometimes.  This  works  if  between  a  and  b  the  segment  of  curve  is  traced  out  exactly 
once.) 
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EXAMPLE  13.3.1    Let's  find  the  length  of  one  turn  of  the  hehx  r  =  (cost,  sint,  t)  (see 

figure  13.1.1).  We  compute  r'  =  (— sin t,  cost,  1)  and  |r'|  =  ^J siv?  t  +  cos^  t  +  1  =  a/2,  so 
the  length  is 

r.27r 

V2dt  =  2V2tx. 

□ 

EXAMPLE  13.3.2  Suppose  y  =  Inx;  what  is  the  length  of  this  curve  between  x  =  1 
and  X  =  a/3? 

Although  this  problem  does  not  appear  to  involve  vectors  or  three  dimensions,  we  can 
interpret  it  in  those  terms:  let  r{t)  =  (t,lnt,0).  This  vector  function  traces  out  precisely 
y  =  \nx  in  the  x-y  plane.  Then  r'{t)  =  (1,  l/t,0)  and  |r'(t)|  =  a/1  +  1/t^  and  the  desired 
length  is 


/ I  1 
^^l  +  :^dt^2-V2  +  ln{V2  +  1)  -  2 


In  3. 

2 

(This  integral  is  a  bit  tricky,  but  requires  only  methods  we  have  learned.)  □ 

Notice  that  there  is  nothing  special  about  y  =  lnx,  except  that  the  resulting  integral 
can  be  computed.  In  general,  given  any  y  =  /(x),  we  can  think  of  this  as  the  vector 
function  r{t)  =  (t,/(t),0).  Then  r'{t)  =  (l,/'(t),0)  and  |r'(t)|  =  ^/l  +  {ff.  The  length 
of  the  curve  y  =  f{x)  between  a  and  b  is  thus 


/    Vl  +  (/'(x))2rfx. 

J  a 


Unfortunately,  such  integrals  are  often  impossible  to  do  exactly  and  must  be  approximated. 

One  useful  application  of  arc  length  is  the  arc  length  parameterization.  A  vector 
function  r(t)  gives  the  position  of  a  point  in  terms  of  the  parameter  t,  which  is  often  time, 
but  need  not  be.  Suppose  s  is  the  distance  along  the  curve  from  some  fixed  starting  point; 
if  we  use  s  for  the  variable,  we  get  r(s),  the  position  in  space  in  terms  of  distance  along  the 
curve.  We  might  still  imagine  that  the  curve  represents  the  position  of  a  moving  object; 
now  we  get  the  position  of  the  object  as  a  function  of  how  far  the  object  has  traveled. 

EXAMPLE  13.3.3  Suppose  r(t)  =  (cost,  sin t,  0).  We  know  that  this  curve  is  a  circle 
of  radius  1.  While  t  might  represent  time,  it  can  also  in  this  case  represent  the  usual 
angle  between  the  positive  x-axis  and  r(t).  The  distance  along  the  circle  from  (1,0,0)  to 
(cost,  sin t,  0)  is  also  t — this  is  the  definition  of  radian  measure.  Thus,  in  this  case  s  =  t 
and  r(s)  =  (cos  s,  sin  s,  0).  □ 
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EXAMPLE  13.3.4  Suppose  r{t)  =  (cos  t,  sin  t,  t) .  We  know  that  this  curve  is  a  helix. 
The  distance  along  the  helix  from  (1,0,0)  to  (cos  t,  sin  t,  t)  is 

s  =  /   \r'{u)\du=  /  V  cos^  u  +  sin^  u+ldu=  /  V2du  =  V2t. 
Jo  Jo  Jo 

Thus,  the  value  of  t  that  gets  us  distance  s  along  the  helix  is  t  =  s/\/2,  and  so  the  same 
curve  is  given  by  r(s)  =  (cos(s/v^),  sin(s/v/2),  s/-\/2).  □ 

In  general,  if  we  have  a  vector  function  r{t),  to  convert  it  to  a  vector  function  in  terms 
of  arc  length  we  compute 

s=  f  \v'{u)\du  =  f{t), 

J  a 

solve  s  =  f{t)  for  t,  getting  t  =  g{s),  and  substitute  this  back  into  r{t)  to  get  f  (s)  =  r{g{s)). 

Suppose  that  t  is  time.  By  the  Fundamental  Theorem  of  Calculus,  if  we  start  with  arc 
length 

s{t)  =  I  \r'{u)\du 

J  a 

and  take  the  derivative,  we  get 

s'{t)  =  \v'{t)\. 

Here  s'{t)  is  the  rate  at  which  the  arc  length  is  changing,  and  we  have  seen  that  \r'{t)\  is 
the  speed  of  a  moving  object;  these  are  of  course  the  same. 

Suppose  that  y{s)  is  given  in  terms  of  arc  length;  what  is  |r'(s)|?  It  is  the  rate  at  which 
arc  length  is  changing  relative  to  arc  length]  it  must  be  1!  In  the  case  of  the  helix,  for 
example,  the  arc  length  parameterization  is  (cos(s/\/2),  sin(s/v^),  s/-\/2),  the  derivative 
is  (-sin(s/V2)/V2,cos(s/V2)/V2,l/V2),  and  the  length  of  this  is 

^^.'^.\-^,-. 

So  in  general,  r'  is  a  unit  tangent  vector. 

Given  a  curve  r(t),  we  would  like  to  be  able  to  measure,  at  various  points,  how  sharply 
curved  it  is.  Clearly  this  is  related  to  how  "fast"  a  tangent  vector  is  changing  direction,  so 
a  first  guess  might  be  that  we  can  measure  curvature  with  |r"(t)|.  A  little  thought  shows 
that  this  is  flawed;  if  we  think  of  t  as  time,  for  example,  we  could  be  tracing  out  the  curve 
more  or  less  quickly  as  time  passes.  The  second  derivative  |r"(t)|  incorporates  this  notion 
of  time,  so  it  depends  not  simply  on  the  geometric  properties  of  the  curve  but  on  how 
quickly  we  move  along  the  curve. 
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EXAMPLE  13.3.5  Consider  r{t)  =  (cos  t,  sin  t,  0)  and  s{t)  =  (cos2t,sin2t,  0).  Both 
of  these  vector  functions  represent  the  unit  circle  in  the  x-y  plane,  but  if  t  is  interpreted 
as  time,  the  second  describes  an  object  moving  twice  as  fast  as  the  first.  Computing  the 
second  derivatives,  we  find  |r"(t)|  =  1,  |s"(t)|  =4.  □ 

To  remove  the  dependence  on  time,  we  use  the  arc  length  parameterization.  If  a  curve 
is  given  by  r(s),  then  the  first  derivative  r'(s)  is  a  unit  vector,  that  is,  r'(s)  =  T(s).  We 
now  compute  the  second  derivative  r"{s)  =  T'(s)  and  use  |T'(s)|  as  the  "official"  measure 
of  curvature,  usually  denoted  n. 

EXAMPLE  13.3.6  We  have  seen  that  the  arc  length  parameterization  of  a  particu- 
lar helix  is  r(s)  =  (cos(s/-\/2),  sin(s/v^),  s/\/2).  Computing  the  second  derivative  gives 
r"{s)  =  (-cos(s/\/2)/2,  -sin(s/\/2)/2,0)  with  length  1/2.  □ 

What  if  we  are  given  a  curve  as  a  vector  function  r(t),  where  t  is  not  arc  length? 
We  have  seen  that  arc  length  can  be  difficult  to  compute;  fortunately,  we  do  not  need  to 
convert  to  the  arc  length  parameterization  to  compute  curvature.  Instead,  let  us  imagine 
that  we  have  done  this,  so  we  have  found  t  =  g{s)  and  then  formed  r{s)  =  r{g{s)).  The 
first  derivative  f'(s)  is  a  unit  tangent  vector,  so  it  is  the  same  as  the  unit  tangent  vector 
T{t)  —  T{g{s)).  Taking  the  derivative  of  this  we  get 

^T{g{s))  =  T'{g{s))g'{s)  =  T'{t)f^. 

The  curvature  is  the  length  of  this  vector: 

dt.      \T'{t)\  \T'{t)\ 


K=\T'{t)\\ 


ds'     \ds/dt\  \r'{t)\ 


(Recall  that  we  have  seen  that  ds/dt  =  |r'(t)|.)  Thus  we  can  compute  the  curvature  by 
computing  only  derivatives  with  respect  to  t;  we  do  not  need  to  do  the  conversion  to  arc 
length. 

EXAMPLE  13.3.7  Returning  to  the  helix,  suppose  we  start  with  the  parameteri- 
zation r{t)  =  (cost, sin t, t).  Then  r'{t)  —  (— sin cost,  1),  \r'{t)\  =  \/2,  and  T{t)  = 
{-smt,cost,l)/V2.  Then  T'{t)  =  (- cost,  -  sin t,  0)7^2  and  |T'(t)|  =  1/^2.  Finally, 
K  =  I/V2/V2  =  1/2,  as  before.  □ 

EXAMPLE  13.3.8  Consider  this  circle  of  radius  a:  r{t)  =  (acost,  asint,  1).  Then 
^'(^)  =  (— asint,  ocost,  0),  |r'(t)|  =  a,  and  T(t)  =  (— asint,  acost,  0)/a.  Now  T'(t)  = 
(— acost,  — asint,  0)/a  and  |T'(t)|  =  1.   Finally,  n  =  1/a:  the  curvature  of  a  circle  is 
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everywhere  the  inverse  of  the  radius.  It  is  sometimes  useful  to  think  of  curvature  as 
describing  what  circle  a  curve  most  resembles  at  a  point.  The  curvature  of  the  helix  in  the 
previous  example  is  1/2;  this  means  that  a  small  piece  of  the  helix  looks  very  much  like  a 
circle  of  radius  2,  as  shown  in  figure  13.3.1.  □ 


Figure  13.3.1     A  circle  with  the  same  curvature  as  the  helix. 
EXAMPLE  13.3.9    Consider  r(t)  =  (cost,  sint,  cos2t),  as  shown  in  figure  13.2.4.  r'{t) 


(-sint,cost, -2sin(2t))  and  |r'(t)|  =  -^1  +  4 sin^(2t),  so 

,      ,  sint  cost  — 2sin2t 

T{t) 


l+4sin^(2t)   ^l  +  4sin^(2t)   ^l  +  4sin^(2t) 

Computing  the  derivative  of  this  and  then  the  length  of  the  resulting  vector  is  possible  but 
unpleasant.  □ 

Fortunately,  there  is  an  alternate  formula  for  the  curvature  that  is  often  simpler  than 
the  one  we  have: 

\r'(t)xr"{t)\ 


K 


EXAMPLE  13.3.10  Returning  to  the  previous  example,  we  compute  the  second  deriva- 
tive r"(t)  =  (—cost,  —  sint,  — 4cos(2t)).  Then  the  cross  product  r'(t)  x  r"(t)  is 


(—4  cos  t  cos  2t  —  2  sin  t  sin  2t,  2  cos  t  sin  2t  —  4  sin  t  cos  2t,  1) . 
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Computing  the  length  of  this  vector  and  dividing  by  |r'(t)|^  is  stiU  a  bit  tedious.  With  the 
aid  of  a  computer  we  get 


K 


V48  cos4t-48  cos2t +  17 
(-16cos4t  +  16  cos2t  +  1)3/2 


Graphing  this  we  get 


2 


27r 


Compare  this  to  figure  13.2.4 — you  may  want  to  load  the  Java  applet  there  so  that  you 
can  see  it  from  different  angles.  The  highest  curvature  occurs  where  the  curve  has  its 
highest  and  lowest  points,  and  indeed  in  the  picture  these  appear  to  be  the  most  sharply 
curved  portions  of  the  curve,  while  the  curve  is  almost  a  straight  line  midway  between 
those  points.  □ 

Let's  see  why  this  alternate  formula  is  correct.  Starting  with  the  definition  of  T, 
r'  =  |r'|T  so  by  the  product  rule  r"  =  |r'|'T  +  |r'|T'.  Then  by  Theorem  12.4.1  the  cross 
product  is 

r  X  r"  =  \r'\T  x  |r'|'T  +  |r'|T  x  |r'|T' 
=  |r'||r'|'(T  X  T)  +  |r'p(T  x  T') 
=  |r'p(T  X  T') 
because  T  x  T  =  0,  since  T  is  parallel  to  itself.  Then 

|r'  X  r"|  =  |r'|2|T  x  T'| 

=  |r'|2|T||T'|sin^ 
=  |r?|T'| 

using  exercise  8  in  section  13.2  to  see  that  9  —  tv/2.  Dividing  both  sides  by  |r'p  then  gives 
the  desired  formula. 

We  used  the  fact  here  that  T'  is  perpendicular  to  T;  the  vector  N  =  T'/|T'|  is  thus  a 
unit  vector  perpendicular  to  T,  called  the  unit  normal  to  the  curve.  Occasionally  of  use 
is  the  unit  binormal  B  =  T  x  N,  a  unit  vector  perpendicular  to  both  T  and  N. 
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Exercises  13.3. 

1.  Find  the  length  of  (Scosi,  2t,  3sint),  t  G  [0,27r].  =^ 

2.  Find  the  length  of  {t^,2,t^),  te[0,l].  ^ 

3.  Find  the  length  of  (t^,  sin t,  cost),  t  G  [0, 1].  =^ 

4.  Find  the  length  of  the  curve  y  =  x^^"^ ,  a;  G  [1,  9].  =^ 

5.  Set  up  an  integral  to  compute  the  length  of  (cost,  sin t,  e*),  t  G  [0,5].  (It  is  tedious  but  not 
too  difficult  to  compute  this  integral.)  =^ 

6.  Find  the  curvature  of  {t,t'^,t). 

7.  Find  the  curvature  of  {t,t'^,t'^).  =^ 

8.  Find  the  curvature  of  {t,t'^,t^).  =^ 

9.  Find  the  curvature  of  y  =      at  (1, 1).  =^ 

13.4    Motion  along  a  curve 

We  have  already  seen  that  if  t  is  time  and  an  object's  location  is  given  by  r(t),  then  the 
derivative  r'(t)  is  the  velocity  vector  v(t).  Just  as  v(t)  is  a  vector  describing  how  r(t) 
changes,  so  is  v'(t)  a  vector  describing  how  v(t)  changes,  namely,  a(t)  =  v'(t)  =  r"(t)  is 
the  acceleration  vector. 

EXAMPLE  13.4.1  Suppose  r{t)  =  (cost,  sin t,  1).  Then  v(t)  =  (- sin t,  cost,  0)  and 
a(t)  =  (—  cost,  —  sint,  0).  This  describes  the  motion  of  an  object  traveling  on  a  circle  of 
radius  1,  with  constant  z  coordinate  1.  The  velocity  vector  is  of  course  tangent  to  the 
curve;  note  that  a  ■  v  =  0,  so  v  and  a  are  perpendicular.  In  fact,  it  is  not  hard  to  see  that 
a  points  from  the  location  of  the  object  to  the  center  of  the  circular  path  at  (0,  0, 1).  □ 

Recall  that  the  unit  tangent  vector  is  given  by  T(t)  =  v(t)/|v(t)|,  so  v  =  |v|T.  If  we 
take  the  derivative  of  both  sides  of  this  equation  we  get 

a=  |v|'T+ |v|T'.  (13.4.1) 

Also  recall  the  definition  of  the  curvature,  k  =  |T'|/|v|,  or  |T'|  =  k|v|.  Finally,  recall  that 
we  defined  the  unit  normal  vector  as  N  =  T'/|T'|,  so  T'  =  |T'|N  =  k|v|N.  Substituting 
into  equation  13.4.1  we  get 

a=  |v|'T  +  K|vpN.  (13.4.2) 

The  quantity  |v(t)|  is  the  speed  of  the  object,  often  written  as  v{t);  |v(t)|'  is  the  rate 
at  which  the  speed  is  changing,  or  the  scalar  acceleration  of  the  object,  a(t).  Rewriting 
equation  13.4.2  with  these  gives  us 

a  =  aT  +  Kv'^N  =  arT  +  ajvN; 

ttT  is  the  tangential  component  of  acceleration  and  oat  is  the  normal  component 
of  acceleration.  We  have  already  seen  that  ar  measures  how  the  speed  is  changing;  if 
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you  are  riding  in  a  vehicle  with  large  gt  you  will  feel  a  force  pulling  you  into  your  seat. 
The  other  component,  a^,  measures  how  sharply  your  direction  is  changing  with  respect 
to  time.  So  it  naturally  is  related  to  how  sharply  the  path  is  curved,  measured  by  k,,  and 
also  to  how  fast  you  are  going.  Because  aN  includes  f^,  note  that  the  effect  of  speed  is 
magnified;  doubling  your  speed  around  a  curve  quadruples  the  value  of  apj.  You  feel  the 
effect  of  this  as  a  force  pushing  you  toward  the  outside  of  the  curve,  the  "centrifugal  force." 
In  practice,  if  want  a  at  we  would  use  the  formula  for  k: 


ttN  =  /t|V 


|r' 

X 

r"| 

|r'| 

3 

To  compute      we  can  project  a  onto  v: 

V  ■  a      r'  ■  r' 


EXAMPLE  13.4.2     Suppose  r  =  Compute  v,  a,  ar,  and  gn- 

Taking  derivatives  we  get  v  =  (1,  2t,  3t^)  and  a  =  (0,  2,  6t).  Then 

4t  +  18t^  ,  V4  +  36^2  +  36t4 

ttT  =    ,  and    otv  =  — , 

Vl  +  4^2  +  9^4  Vl  +  4^2  +  9^4 

□ 

Exercises  13.4- 

1.  Let  r  =  (cost, sin Compute  v,  a,  ax,  and  aw-  =^ 

2.  Let  r  =  (cost, sin t,t^).  Compute  v,  a,  ct,  and  aN.  =^ 

3.  Let  r  =  (cos  t,  sint,  e*).  Compute  v,  a,  ar,  and  oat. 

4.  Let  r  =  (e*,  sint,  e*).  Compute  v,  a,  ar,  and  cat. 

5.  Suppose  an  object  moves  so  that  its  acceleration  is  given  by  a  =  (— 3cost,  — 2sint,  0).  At 
time  t  =  0  the  object  is  at  (3,  0,  0)  and  its  velocity  vector  is  (0,  2,  0).  Find  v(t)  and  r{t)  for 
the  object.  =^> 

6.  Suppose  an  object  moves  so  that  its  acceleration  is  given  by  a  =  (— 3  cost,  — 2 sint,  0).  At 
time  t  =  0  the  object  is  at  (3,  0,  0)  and  its  velocity  vector  is  (0,  2.1,  0).  Find  v(t)  and  r(t)  for 
the  object.  =^ 

7.  Suppose  an  object  moves  so  that  its  acceleration  is  given  by  a  =  (— 3  cost,  — 2 sint,  0).  At 
time  t  =  0  the  object  is  at  (3,  0,  0)  and  its  velocity  vector  is  (0,  2, 1).  Find  v(t)  and  r(t)  for 
the  object.  =^> 

8.  Suppose  an  object  moves  so  that  its  acceleration  is  given  by  a  =  (— 3cost,  — 2sint,  0).  At 
time  t  =  0  the  object  is  at  (3,  0, 0)  and  its  velocity  vector  is  (0,  2.1, 1).  Find  v(t)  and  r(t)  for 
the  object.  =^> 
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9.  Describe  a  situation  in  which  the  normal  component  of  acceleration  is  0  and  the  tangential 
component  of  acceleration  is  non-zero.  Is  it  possible  for  the  tangential  component  of  accel- 
eration to  be  0  while  the  normal  component  of  acceleration  is  non-zero?  Explain.  Finally,  is 
it  possible  for  an  object  to  move  (not  be  stationary)  so  that  both  the  tangential  and  normal 
components  of  acceleration  are  0?  Explain. 


14 

Partial  Differentiation 


14.1    Functions  of  Several  Variables 

In  single-variable  calculus  we  were  concerned  with  functions  that  map  the  real  numbers  R 
to  R,  sometimes  called  "real  functions  of  one  variable" ,  meaning  the  "input"  is  a  single  real 
number  and  the  "output"  is  likewise  a  single  real  number.  In  the  last  chapter  we  considered 
functions  taking  a  real  number  to  a  vector,  which  may  also  be  viewed  as  functions  /:  R  — > 
R^,  that  is,  for  each  input  value  we  get  a  position  in  space.  Now  we  turn  to  functions 
of  several  variables,  meaning  several  input  variables,  functions  /:  R"^  — ?>  R.  We  will  deal 
primarily  with  n  =  2  and  to  a  lesser  extent  n  =  3;  in  fact  many  of  the  techniques  we 
discuss  can  be  applied  to  larger  values  of  n  as  well. 

A  function  /:  R^  — )■  R  maps  a  pair  of  values  (x,  y)  to  a  single  real  number.  The  three- 
dimensional  coordinate  system  we  have  already  used  is  a  convenient  way  to  visualize  such 
functions:  above  each  point  {x,y)  in  the  x-y  plane  we  graph  the  point  {x,y,z),  where  of 
course  z  =  f{x,  y). 

EXAMPLE  14.1.1  Consider  f{x,  y)  =  2>x  +  Ay- 5.  Writing  this  as  2;  =  3a;  -h  4y  -  5  and 
then  Sx  +  Ay  —  z  =  5  we  recognize  the  equation  of  a  plane.  In  the  form  f{x,  y)  =  3x-|-4j/  — 5 
the  emphasis  has  shifted:  we  now  think  of  x  and  y  as  independent  variables  and  2;  as  a 
variable  dependent  on  them,  but  the  geometry  is  unchanged.  □ 

example  14.1.2  We  have  seen  that  -\- y"^  z"^  =4  represents  a  sphere  of  radius  2. 
We  cannot  write  this  in  the  form  f[x,y),  since  for  each  x  and  y  in  the  disk  x'^  +y'^  <  4  there 
are  two  corresponding  points  on  the  sphere.  As  with  the  equation  of  a  circle,  we  can  resolve 
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this  equation  into  two  functions,  f{x,y)  =  a/4  —  —  and  f{x,y)  =  — a/4  —  —  y^, 
representing  the  upper  and  lower  hemispheres.  Each  of  these  is  an  example  of  a  function 
with  a  restricted  domain:  only  certain  values  of  x  and  y  make  sense  (namely,  those  for 
which  x'^  +  y'^  <  4)  and  the  graphs  of  these  functions  are  limited  to  a  small  region  of  the 
plane.  □ 

EXAMPLE  14.1.3  Consider  /  =  ^/x  +  ^/y.  This  function  is  defined  only  when  both 
X  and  y  are  non-negative.  When  y  =  0  we  get  f{x,y)  =  y/x,  the  familiar  square  root 
function  in  the  x-z  plane,  and  when  x  =  0  we  get  the  same  curve  in  the  y-z  plane. 
Generally  speaking,  we  see  that  starting  from  /(O,  0)  =  0  this  function  gets  larger  in  every 
direction  in  roughly  the  same  way  that  the  square  root  function  gets  larger.  For  example, 
if  we  restrict  attention  to  the  line  x  =  y,  we  get  /(x,  y)  =  2i/x  and  along  the  line  y  =  2x 
we  have  /(x,  y)  =  y/x  +  a/2x  =  (1  +  a/2)a/x.  □ 


Figure  14.1.1     f{x,y)  =  ^+^iAP) 

A  computer  program  that  plots  such  surfaces  can  be  very  useful,  as  it  is  often  difficult 
to  get  a  good  idea  of  what  they  look  like.  Still,  it  is  valuable  to  be  able  to  visualize 
relatively  simple  surfaces  without  such  aids.  As  in  the  previous  example,  it  is  often  a  good 
idea  to  examine  the  function  on  restricted  subsets  of  the  plane,  especially  lines.  It  can  also 
be  useful  to  identify  those  points  (x,  y)  that  share  a  common  z- value. 

EXAMPLE  14.1.4  Consider  f{x,y)  =  x^  +  y'^.  When  x  =  0  this  becomes  f  =  y'^,  a 
parabola  in  the  y-z  plane;  when  y  =  0  we  get  the  "same"  parabola  /  =  x^  in  the  x-z  plane. 
Now  consider  the  line  y  =  kx.  If  we  simply  replace  y  by  kx  we  get  f{x,y)  =  (1  -f  /c^)x^ 
which  is  a  parabola,  but  it  does  not  really  "represent"  the  cross-section  along  y  =  kx, 
because  the  cross-section  has  the  line  y  =  kx  where  the  horizontal  axis  should  be.  In 
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order  to  pretend  that  this  hne  is  the  horizontal  axis,  we  need  to  write  the  function  in 
terms  of  the  distance  from  the  origin,  which  is  a/x^  +  =  a/x^  +  A;^x^.  Now  f{x,y)  = 
+  /c^x^  =  (a/x^  +  k'^x'^Y .  So  the  cross-section  is  the  "same"  parabola  as  in  the  x-z  and 
y-z  planes,  namely,  the  height  is  always  the  distance  from  the  origin  squared.  This  means 
that  f{x,y)  =  x^  +  can  be  formed  by  starting  with  z  =  x'^  and  rotating  this  curve 
around  the  z  axis. 

Finally,  picking  a  value  z  =  k,  ai  what  points  does  f{x,y)  =  kl  This  means  x^+y^  =  /c, 
which  we  recognize  as  the  equation  of  a  circle  of  radius  \/k.  So  the  graph  of  f{x,y)  has 
parabolic  cross-sections,  and  the  same  height  everywhere  on  concentric  circles  with  center 
at  the  origin.  This  fits  with  what  we  have  already  discovered.  □ 


Figure  14.1.2     f{x,  y)  =  x'^  +  (AP) 

As  in  this  example,  the  points  (x,  y)  such  that  /(x,  y)  =  k  usually  form  a  curve,  called 
a  level  curve  of  the  function.  A  graph  of  some  level  curves  can  give  a  good  idea  of  the 
shape  of  the  surface;  it  looks  much  like  a  topographic  map  of  the  surface.  In  figure  14.1.2 
both  the  surface  and  its  associated  level  curves  are  shown.  Note  that,  as  with  a  topographic 
map,  the  heights  corresponding  to  the  level  curves  are  evenly  spaced,  so  that  where  curves 
are  closer  together  the  surface  is  steeper. 

Functions  /:]R"  — )■  M  behave  much  like  functions  of  two  variables;  we  will  on  occasion 
discuss  functions  of  three  variables.  The  principal  difficulty  with  such  functions  is  visual- 
izing them,  as  they  do  not  "fit"  in  the  three  dimensions  we  are  familiar  with.  For  three 
variables  there  are  various  ways  to  interpret  functions  that  make  them  easier  to  under- 
stand. For  example,  /(x,  y,  z)  could  represent  the  temperature  at  the  point  (x,  y,  z),  or  the 
pressure,  or  the  strength  of  a  magnetic  field.  It  remains  useful  to  consider  those  points  at 
which  /(x,  y,  z)  =  /c,  where  k  is  some  constant  value.  If  /(x,  y,  z)  is  temperature,  the  set  of 
points  (x,  y,  z)  such  that  /(x,  y,z)  =  k  is  the  collection  of  points  in  space  with  temperature 
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k;  in  general  this  is  called  a  level  set;  for  three  variables,  a  level  set  is  typically  a  surface, 
called  a  level  surface. 

EXAMPLE  14.1.5  Suppose  the  temperature  at  {x,y,z)  is  T{x,y,z)  =  e~('^^+^^+^^). 
This  function  has  a  maximum  value  of  1  at  the  origin,  and  tends  to  0  in  all  directions. 
If  k  is  positive  and  at  most  1,  the  set  of  points  for  which  T{x,y,z)  =  k  is  those  points 
satisfying  +  +  =  —  In  /c,  a  sphere  centered  at  the  origin.  The  level  surfaces  are  the 
concentric  spheres  centered  at  the  origin.  □ 

Exercises  14- 1- 

1.  Let  /(x,  y)  =  {x  —  y)^.  Determine  the  equations  and  shapes  of  the  cross-sections  when  x  =  0, 
y  =  0,  X  =  y,  and  describe  the  level  curves.  Use  a  three-dimensional  graphing  tool  to  graph 
the  surface. 

2.  Let  f{x,  y)  =  \x\  +  \y\.  Determine  the  equations  and  shapes  of  the  cross-sections  when  x  =  0, 
y  =  0,  X  =  y,  and  describe  the  level  curves.  Use  a  three-dimensional  graphing  tool  to  graph 
the  surface.  =^> 

3.  Let  /(x,  y)  =  e^^^  ^  sin(x'^+y'^).  Determine  the  equations  and  shapes  of  the  cross-sections 
when  X  =  0,  y  =  0,  X  =  y,  and  describe  the  level  curves.  Use  a  three-dimensional  graphing 
tool  to  graph  the  surface.  =^> 

4.  Let  /(x,  y)  =  sin(x  —  y).  Determine  the  equations  and  shapes  of  the  cross-sections  when 
X  =  0,  y  =  0,  X  =  y,  and  describe  the  level  curves.  Use  a  three-dimensional  graphing  tool  to 
graph  the  surface.  =^ 

5.  Let  /(x,y)  =  (x^  —  y^)^.  Determine  the  equations  and  shapes  of  the  cross-sections  when 
X  =  0,  y  =  0,  X  =  y,  and  describe  the  level  curves.  Use  a  three-dimensional  graphing  tool  to 
graph  the  surface.  =^> 

6.  Find  the  domain  of  each  of  the  following  functions  of  two  variables: 

a.  ^/9  -  x'^  +  \/y2  -  4 

b.  arcsin(x^  +  y'^  —  2) 

c.  ^16  —  x^  —  4y2 

7.  Below  are  two  sets  of  level  curves.  One  is  for  a  cone,  one  is  for  a  paraboloid.  Which  is  which? 
Explain. 
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14.2    Limits  and  Continuity 

To  develop  calculus  for  functions  of  one  variable,  we  needed  to  make  sense  of  the  concept  of 
a  limit,  which  we  needed  to  understand  continuous  functions  and  to  define  the  derivative. 
Limits  involving  functions  of  two  variables  can  be  considerably  more  difficult  to  deal  with; 
fortunately,  most  of  the  functions  we  encounter  are  fairly  easy  to  understand. 

The  potential  difficulty  is  largely  due  to  the  fact  that  there  are  many  ways  to  "ap- 
proach" a  point  in  the  x-y  plane.  If  we  want  to  say  that      lim      /(x,  y)  =  L,  we  need  to 

capture  the  idea  that  as  (x,  y)  gets  close  to  (a,  h)  then  /(x,  y)  gets  close  to  L.  For  functions 
of  one  variable,  /(x),  there  are  only  two  ways  that  x  can  approach  a:  from  the  left  or  right. 
But  there  are  an  infinite  number  of  ways  to  approach  (a,  6):  along  any  one  of  an  infinite 
number  of  lines,  or  an  infinite  number  of  parabolas,  or  an  infinite  number  of  sine  curves, 
and  so  on.  We  might  hope  that  it's  really  not  so  bad — suppose,  for  example,  that  along 
every  possible  line  through  (a,  h)  the  value  of  /(x,  y)  gets  close  to  L;  surely  this  means  that 
''f{x,y)  approaches  L  as  (x,y)  approaches  (a,  6)".  Sadly,  no. 

EXAMPLE  14.2.1  Consider  /(x,  y)  =  xy^/{x^  +  y^).  When  x  =  0  or  y  =  0,  /(x,  y)  is 
0,  so  the  limit  of  f{x,y)  approaching  the  origin  along  either  the  x  or  y  axis  is  0.  Moreover, 
along  the  line  y  =  mx,  f{x,y)  =  m^x^/(x^  +  m'*x^).  As  x  approaches  0  this  expression 
approaches  0  as  well.  So  along  every  line  through  the  origin  f{x,y)  approaches  0.  Now 
suppose  we  approach  the  origin  along  x  —  y^.  Then 


y^y^         y4  i 


f{x,y) 


y^  +  y^      2y4  2 


so  the  limit  is  1/2.  Looking  at  figure  14.2.1,  it  is  apparent  that  there  is  a  ridge  above 
X  =  y"^.  Approaching  the  origin  along  a  straight  line,  we  go  over  the  ridge  and  then  drop 
down  toward  0,  but  approaching  along  the  ridge  the  height  is  a  constant  1/2.  □ 

Fortunately,  we  can  define  the  concept  of  limit  without  needing  to  specify  how  a 
particular  point  is  approached — indeed,  in  definition  2.3.2,  we  didn't  need  the  concept  of 
"approach."  Roughly,  that  definition  says  that  when  x  is  close  to  a  then  /(x)  is  close  to 
L;  there  is  no  mention  of  "how"  we  get  close  to  a.  We  can  adapt  that  definition  to  two 
variables  quite  easily: 

DEFINITION  14.2.2    Limit      Suppose  /(x,y)  is  a  function.  We  say  that 

lim      f{x,y)  =  L 
(^>y)^(«,o) 

if  for  every  e  >  0  there  is  a  5  >  0  so  that  whenever  0  <  a/(x  —  a)^  +  {y  —  bY  <  S, 
\f{x,y)-L\<e.  □ 
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Figure  14.2.1     f{x,y)  =  (AP) 

This  says  that  we  can  make  \f{x,y)  —  L\  <  e,  no  matter  how  small  e  is,  by  making  the 
distance  from  (x,  y)  to  (a,  b)  "small  enough" . 


EXAMPLE  14.2.3    We  show  that  lim 


3x^y 


(x,y)^(0,0)        +  y 


0.  Suppose  e  >  0.  Then 


3x^y 


rj.2    _|_  y2 


x" 


x^  +  y 


Note  that  x^/(x^  +  y^)  <  1  and  \y\  =  >/y^  <  a/x^  +  y2  <  s.  So 


X" 


y.2  y2 


3|2/|  <  l-3-(5. 
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We  want  to  force  this  to  be  less  than  e  by  picking  5  "small  enough."  If  we  choose  6  =  e/3 
then 


3x  y 


<  1  -3 


□ 


Recall  that  a  function  /(x)  is  continuous  at  x  =  a  if  lim  /(x)  =  /(a);  roughly  this 

a;— >a 

says  that  there  is  no  "hole"  or  "jump"  at  x  =  a.  We  can  say  exactly  the  same  thing  about 
a  function  of  two  variables:  /(x,y)  is  continuous  at  (a,  6)  if      lim      f{x,y)  =  f{a,b). 


EXAMPLE  14.2.4  The  function  f{x,y)  =  3x'^y/{x'^  +  y'^)  is  not  continuous  at  (0,0), 
because  /(0,0)  is  not  defined.  However,  we  know  that      lim      f{x,y)  =  0,  so  we  can 

(x,y)^(0,0) 

easily  "fix"  the  problem,  by  extending  the  definition  of  /  so  that  /(O,  0)  =  0.  This  surface 
is  shown  in  figure  14.2.2.  □ 


Figure  14.2.2     f{x,y)  =  (AP) 

Note  that  in  contrast  to  this  example  we  cannot  fix  example  14.2.1  at  (0,0)  because 
the  limit  does  not  exist.  No  matter  what  value  we  try  to  assign  to  /  at  (0,  0)  the  surface 
will  have  a  "jump"  there. 
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Fortunately,  the  functions  we  will  examine  will  typically  be  continuous  almost  ev- 
erywhere. Usually  this  follows  easily  from  the  fact  that  closely  related  functions  of  one 
variable  are  continuous.  As  with  single  variable  functions,  two  classes  of  common  functions 
are  particularly  useful  and  easy  to  describe.  A  polynomial  in  two  variables  is  a  sum  of 
terms  of  the  form  ax'^y^,  where  a  is  a  real  number  and  m  and  n  are  non- negative  integers. 
A  rational  function  is  a  quotient  of  polynomials. 

THEOREM  14.2.5  Polynomials  are  continuous  everywhere.  Rational  functions  are 
continuous  everywhere  they  are  defined.  ■ 

Exercises  14-2. 

Determine  whether  each  hmit  exists.  If  it  does,  find  the  hmit  and  prove  that  it  is  the  hmit;  if  it 
does  not,  explain  how  you  know. 

1.  lim 


2. 
3. 

4. 

5. 
6. 

7. 

8. 

9. 

10. 
11. 

12. 


^,y)^(0,0)  X2  +  y2 

hm  — — — - 
xy 


Um 


x,y)^{0,0)  2x2  +  y2 
4  4 

X  —  y 
lim  — — — -  =^ 

x,y)^{ofi)  x^  +  y2 

sin(a;^+y^) 

lim   V- — ^ 

x,y)^{ofi)     x^  +  y^ 

xy 


hm 


x,y)^{0,0)  ^2x2  +  y2 
2  2 

e-"  -y  -  ] 

hm 


x,y)^{Ofi)  x2+y2 

hm  — — — -  =^ 

x,y)^{Ofi)  x2  +  y^ 

+  sin^  y 
lim        ^  „   =^ 

x,y)^{0,0)    2x2 +y2 

(x  —  1)'^  Inx 


lim 


x,y)^{l,0)  (x  -  1)2  +y2 

hm       3x  +  4w  =^ 

x,y)^{l-l) 

hm        ^^'^  - 


;,y)^(0,0)  x2  +  J/2 


X  —  J/ 

13.  Does  the  function  f{x,y)  =   have  any  discontinuities?    What  about  f{x,y) 

1  ~1~  X  ~\~  y 

X  —  y      „  ^    ,  . 
— — — — -1  Explam. 
1  +  x^  + 
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14.3    Partial  Differentiation 

When  we  first  considered  what  the  derivative  of  a  vector  function  might  mean,  there  was 
reaUy  not  much  difficulty  in  understanding  either  how  such  a  thing  might  be  computed  or 
what  it  might  measure.  In  the  case  of  functions  of  two  variables,  things  are  a  bit  harder 
to  understand.  If  we  think  of  a  function  of  two  variables  in  terms  of  its  graph,  a  surface, 
there  is  a  more-or-less  obvious  derivative-like  question  we  might  ask,  namely,  how  "steep" 
is  the  surface.  But  it's  not  clear  that  this  has  a  simple  answer,  nor  how  we  might  proceed. 
We  will  start  with  what  seem  to  be  very  small  steps  toward  the  goal;  surprisingly,  it  turns 
out  that  these  simple  ideas  hold  the  keys  to  a  more  general  understanding. 


Imagine  a  particular  point  on  a  surface;  what  might  we  be  able  to  say  about  how  steep 
it  is?  We  can  limit  the  question  to  make  it  more  familiar:  how  steep  is  the  surface  in  a 
particular  direction?  What  does  this  even  mean?  Here's  one  way  to  think  of  it:  Suppose 
we're  interested  in  the  point  (a,  6,  c) .  Pick  a  straight  line  in  the  x-y  plane  through  the 
point  (a,  6,  0),  then  extend  the  line  vertically  into  a  plane.  Look  at  the  intersection  of  the 
plane  with  the  surface.  If  we  pay  attention  to  just  the  plane,  we  see  the  chosen  straight 
line  where  the  x-axis  would  normally  be,  and  the  intersection  with  the  surface  shows  up  as 
a  curve  in  the  plane.  Figure  14.3.1  shows  the  parabolic  surface  from  figure  14.1.2,  exposing 
its  cross-section  above  the  line  x  4-  y  =  1. 
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In  principle,  this  is  a  problem  we  know  how  to  solve:  find  the  slope  of  a  curve  in  a 
plane.  Let's  start  by  looking  at  some  particularly  easy  lines:  those  parallel  to  the  x  or  y 
axis.  Suppose  we  are  interested  in  the  cross-section  of  /(x,  y)  above  the  line  y  =  b.  If  we 
substitute  b  for  y  in  f{x,y),  we  get  a  function  in  one  variable,  describing  the  height  of  the 
cross-section  as  a  function  of  x.  Because  y  =  b  is  parallel  to  the  x-axis,  if  we  view  it  from 
a  vantage  point  on  the  negative  y-axis,  we  will  see  what  appears  to  be  simply  an  ordinary 
curve  in  the  x-z  plane. 


Figure  14.3.2     f{x,  y)  =      +  y"^ ,  cut  by  the  plane  y  =  2  (AP) 


Consider  again  the  parabolic  surface  f{x,y)  =  x'^  +  y"^.  The  cross-section  above  the 
line  y  =  2  consists  of  all  points  (x,  2,  4-4).  Looking  at  this  cross-section  from  somewhere 
on  the  negative  y  axis,  we  see  what  appears  to  be  just  the  curve  /(x)  =  x^  -(-  4.  At  any 
point  on  the  cross-section,  (a,  2,  +  4),  the  steepness  of  the  surface  in  the  direction  of 
the  line  y  =  2  is  simply  the  slope  of  the  curve  /(x)  =  x^  -(-  4,  namely  2x.  Figure  14.3.2 
shows  the  same  parabolic  surface  as  before,  but  now  cut  by  the  plane  y  =  2.  The  left 
graph  shows  the  cut-off  surface,  the  right  shows  just  the  cross-section,  looking  up  from  the 
negative  y-axis  toward  the  origin. 

If,  say,  we're  interested  in  the  point  (—1,2,5)  on  the  surface,  then  the  slope  in  the 
direction  of  the  line  y  =  2  is  2x  =  2(— 1)  =  —2.  This  means  that  starting  at  (—1,  2,  5)  and 
moving  on  the  surface,  above  the  line  y  —  2,  in  the  direction  of  increasing  x  values,  the 
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surface  goes  down;  of  course  moving  in  the  opposite  direction,  toward  decreasing  x  values, 
the  surface  wiU  rise. 

If  we're  interested  in  some  other  hne  y  =  k,  there  is  reaUy  no  change  in  the  computa- 
tion. The  equation  of  the  cross-section  above  y  =  k  \s  +  k"^  with  derivative  2x.  We  can 
save  ourselves  the  effort,  small  as  it  is,  of  substituting  k  for  y:  all  we  are  in  effect  doing 
is  temporarily  assuming  that  y  is  some  constant.  With  this  assumption,  the  derivative 
^(x^  +  y^)  =  2x.  To  emphasize  that  we  are  only  temporarily  assuming  y  is  constant,  we 
use  a  slightly  different  notation:  +  2/^)  =  2x;  the  "9"  reminds  us  that  there  are  more 

variables  than  x,  but  that  only  x  is  being  treated  as  a  variable.  We  read  the  equation 
as  "the  partial  derivative  of  (x^  -|-  y"^)  with  respect  to  x  is  2x."  A  convenient  alternate 
notation  for  the  partial  derivative  of  /(x,  y)  with  respect  to  x  is  is  fx{x,  y). 

EXAMPLE  14.3.1  The  partial  derivative  with  respect  to  x  of  x^  -|-  3xy  is  3x^  +  3y. 
Note  that  the  partial  derivative  includes  the  variable  y,  unlike  the  example  x^  +  y^.  It  is 
somewhat  unusual  for  the  partial  derivative  to  depend  on  a  single  variable;  this  example 
is  more  typical.  □ 

Of  course,  we  can  do  the  same  sort  of  calculation  for  lines  parallel  to  the  y-axis.  We 
temporarily  hold  x  constant,  which  gives  us  the  equation  of  the  cross-section  above  a  line 
X  =  k.  We  can  then  compute  the  derivative  with  respect  to  y;  this  will  measure  the 
steepness  of  the  curve  in  the  y  direction. 

EXAMPLE  14.3.2    The  partial  derivative  with  respect  to  y  of  /(x,  y)  =  sm{xy)  +  3xy 

So  far,  using  no  new  techniques,  we  have  succeeded  in  measuring  the  slope  of  a  surface 
in  two  quite  special  directions.  For  functions  of  one  variable,  the  derivative  is  closely  linked 
to  the  notion  of  tangent  line.  For  surfaces,  the  analogous  idea  is  the  tangent  plane — a 
plane  that  just  touches  a  surface  at  a  point,  and  has  the  same  "steepness"  as  the  surface 
in  all  directions.  Even  though  we  haven't  yet  figured  out  how  to  compute  the  slope  in 
all  directions,  we  have  enough  information  to  find  tangent  planes.  Suppose  we  want  the 
plane  tangent  to  a  surface  at  a  particular  point  (a,  6,  c).  If  we  compute  the  two  partial 
derivatives  of  the  function  for  that  point,  we  get  enough  information  to  determine  two 
lines  tangent  to  the  surface,  both  through  (a,  b,  c)  and  both  tangent  to  the  surface  in  their 
respective  directions.  These  two  lines  determine  a  plane,  that  is,  there  is  exactly  one  plane 
containing  the  two  lines:  the  tangent  plane.  Figure  14.3.3  shows  (part  of)  two  tangent 
lines  at  a  point,  and  the  tangent  plane  containing  them. 
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Figure  14.3.3     Tangent  vectors  and  tangent  plane. 

How  can  we  discover  an  equation  for  this  tangent  plane?  We  know  a  point  on  the 
plane,  (a,  6,  c);  we  need  a  vector  normal  to  the  plane.  If  we  can  find  two  vectors,  one 
parallel  to  each  of  the  tangent  lines  we  know  how  to  find,  then  the  cross  product  of  these 
vectors  will  give  the  desired  normal  vector. 


T 


0  12  3  4 

Figure  14.3.4     A  tangent  vector. 

How  can  we  find  vectors  parallel  to  the  tangent  lines?  Consider  first  the  line  tangent 
to  the  surface  above  the  line  y  =  b.  A  vector  {u,  v,  w)  parallel  to  this  tangent  line  must 
have  y  component  v  =  0,  and  we  may  as  well  take  the  x  component  to  be  m  =  1.  The  ratio 
of  the  z  component  to  the  x  component  is  the  slope  of  the  tangent  line,  precisely  what  we 
know  how  to  compute.  The  slope  of  the  tangent  line  is  fx{ci,  b),  so 
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In  other  words,  a  vector  parallel  to  this  tangent  line  is  (1,  0, /2;(a,  6)),  as  shown  in  fig- 
ure 14.3.4.  If  we  repeat  the  reasoning  for  the  tangent  line  above  a;  =  a,  we  get  the  vector 
(0,l,/,(a,6)). 

Now  to  find  the  desired  normal  vector  we  compute  the  cross  product,  (0,1,/^)  x 
(1,0, /j;)  =  1).   From  our  earlier  discussion  of  planes,  we  can  write  down  the 

equation  we  seek:  fj:{a,h)x  +  fy{a,b)y  ~  z  =  k,  and  k  as  usual  can  be  computed  by 
substituting  a  known  point:  fx{a,b){a)  +  fy{a,b){b)  —  c  =  k.  There  are  various  more-or- 
less  nice  ways  to  write  the  result: 

/a;  (a,  b)x  +  fy{a,  b)y-  z  =  f^^a,  b)a  +  /^(a,  b)b  -  c 
^(a,  b)x  +  /y(a,  b)y  -  fx{a,  b)a  -  /y(a,  b)b  +  c  =  z 
fx{a,b){x  -a)  +  fy{a,b){y  -  b)  +  c  =  z 
fx{a,  b){x  -  a)  +  fy{a,b){y  -  b)  +  f{a,b)  =  z 


EXAMPLE  14.3.3  Find  the  plane  tangent  to  +  +  ^2  ^4  rpj^jg 
point  is  on  the  upper  hemisphere,  so  we  use  f{x,y)  =  a/4  —  —  y^.  Then  fx{x,y)  — 
-x(4  -  x2  -  y2)-i/2  and  /,(x, y)  =  -y{A  -  x"  -  y^)-^'\  so  /,(1, 1)  =  /^(l,  1)  =  -1/v^ 
and  the  equation  of  the  plane  is 

The  hemisphere  and  this  tangent  plane  are  pictured  in  figure  14.3.3.  □ 

So  it  appears  that  to  find  a  tangent  plane,  we  need  only  find  two  quite  simple  ordinary 
derivatives,  namely  fx  and  fy.  This  is  true  if  the  tangent  plane  exists.  It  is,  unfortunately, 
not  always  the  case  that  if  fx  and  fy  exist  there  is  a  tangent  plane.  Consider  the  function 
xy^  jix?'  +  y^)  pictured  in  figure  14.2.1.  This  function  has  value  0  when  x  =  0  or  y  =  0, 
and  we  can  "plug  the  hole"  by  agreeing  that  /(0,0)  =  0.  Now  it's  clear  that  /a;(0,0)  = 
/j,(0,  0)  =  0,  because  in  the  x  and  y  directions  the  surface  is  simply  a  horizontal  line.  But 
it's  also  clear  from  the  picture  that  this  surface  does  not  have  anything  that  deserves  to 
be  called  a  "tangent  plane"  at  the  origin,  certainly  not  the  x-y  plane  containing  these  two 
tangent  lines. 

When  does  a  surface  have  a  tangent  plane  at  a  particular  point?  What  we  really  want 
from  a  tangent  plane,  as  from  a  tangent  line,  is  that  the  plane  be  a  "good"  approximation 
of  the  surface  near  the  point.  Here  is  how  we  can  make  this  precise: 
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DEFINITION  14.3.4    Let  Ax  =  x  -  xq,  Ay  =  y  -  yo,  and  Az  =  z  -  zq  where 
=  fi^o,  yo)-  The  function  z  =  /(x,  y)  is  differentiable  at  (xq,  yo)  if 

Az  =  fj:(xo,yo)Ax  +  fy{xQ,yo)Ay  +  eiAx  +  e2Ay, 

and  both  ei  and  £2  approach  0  as  (x,  y)  approaches  (xo,yo)-  □ 
This  definition  takes  a  bit  of  absorbing.  Let's  rewrite  the  central  equation  a  bit: 

z  =  fx{xo,yo)ix  -  xo)  +  fyixo,yo)iy  -  yo)  +  fi^o^Vo)  +  eiAx  +  e2Ay.  (14.3.1) 
The  first  three  terms  on  the  right  are  the  equation  of  the  tangent  plane,  that  is, 

fx{xo:  yo){x  -  Xo)  +  fy{xo,  yo){y  -  yo)  +  /(a;o,  Vo) 

is  the  2;- value  of  the  point  on  the  plane  above  (x,  y).  Equation  14.3.1  says  that  the  2- value 
of  a  point  on  the  surface  is  equal  to  the  2;-value  of  a  point  on  the  plane  plus  a  "little  bit," 
namely  eiAx  +  e2Ay.  As  {x,y)  approaches  {xo,yo)^  both  Ax  and  Ay  approach  0,  so  this 
little  bit  eiAx  +  £2 Ay  also  approaches  0,  and  the  2- values  on  the  surface  and  the  plane 
get  close  to  each  other.  But  that  by  itself  is  not  very  interesting:  since  the  surface  and 
the  plane  both  contain  the  point  (xq,  yo^zo),  the  z  values  will  approach  zo  and  hence  get 
close  to  each  other  whether  the  tangent  plane  is  "tangent"  to  the  surface  or  not.  The  extra 
condition  in  the  definition  says  that  as  (x,y)  approaches  {xo^yo),  the  e  values  approach 
0 — this  means  that  eiAx  +  e2Ay  approaches  0  much,  much  faster,  because  eiAx  is  much 
smaller  than  either  ei  or  Ax.  It  is  this  extra  condition  that  makes  the  plane  a  tangent 
plane. 

We  can  see  that  the  extra  condition  on  ei  and  £2  is  just  what  is  needed  if  we  look  at 
partial  derivatives.  Suppose  we  temporarily  fix  y  =  yo,  so  Ay  =  0.  Then  the  equation 
from  the  definition  becomes 


Az  =  fx{xo,  yo)Ax  +  eiAx 


or 

^  =  fx{xo,yo)  +  ei- 

Now  taking  the  limit  of  the  two  sides  as  Ax  approaches  0,  the  left  side  turns  into  the 
partial  derivative  of  z  with  respect  to  x  at  (xci/o)?  or  in  other  words  fx{xo,yo),  and  the 
right  side  does  the  same,  because  as  (x,  y)  approaches  (xq,  yo),  ci  approaches  0.  Essentially 
the  same  calculation  works  for  fy. 
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Exercises  14-3. 

1.  Find  fx  and  fy  where  f{x,y) 

2.  Find  fx  and  fy  where  f{x,y) 

3.  Find  fx  and  fy  where  f{x,y) 

4.  Find  fx  and  fy  where  f{x,y) 

5.  Find  fx  and  fy  where  f{x,y) 

6.  Find  fx  and  fy  where  f{x,y) 

7.  Find  fx  and  /j;  where  f{x,y) 

8.  Find  an  equation  for  the  plane  tangent  to  2x'^  +  3y^  —      =  4  at  (1, 1,  —1).  =r- 

9.  Find  an  equation  for  the  plane  tangent  to  f{x,y)  =  sin(2;y)  at  (tt,  1/2, 1).  =^> 

10.  Find  an  equation  for  the  plane  tangent  to  f(x,y)  =  x^  +  y^  at  (3, 1, 10).  =^ 

11.  Find  an  equation  for  the  plane  tangent  to  f{x,y)  =  xln{xy)  at  (2, 1/2,0).  =^> 

12.  Find  an  equation  for  the  line  normal  to  x'^  +       =  2z  at  (2, 1,  4).  =^> 

13.  Explain  in  your  own  words  why,  when  taking  a  partial  derivative  of  a  function  of  multiple 
variables,  we  can  treat  the  variables  not  being  differentiated  as  constants. 

14.  Consider  a  differentiable  function,  f{x,y).  Give  physical  interpretations  of  the  meanings  of 
fx{a,  b)  and  fy{a,  b)  as  they  relate  to  the  graph  of  /. 

15.  In  much  the  same  way  that  we  used  the  tangent  line  to  approximate  the  value  of  a  function 
from  single  variable  calculus,  we  can  use  the  tangent  plane  to  approximate  a  function  from 
multivariable  calculus.  Consider  the  tangent  plane  found  in  Exercise  11.  Use  this  plane  to 
approximate  /(1. 98,  0.4). 

16.  Suppose  that  one  of  your  colleagues  has  calculated  the  partial  derivatives  of  a  given  function, 
and  reported  to  you  that  fx{x,y)  =  2x  +  3y  and  that  fy{x,y)  =  4x  +  6y.  Do  you  believe 
them?  Why  or  why  not?  If  not,  what  answer  might  you  have  accepted  for  fy? 

17.  Suppose  f{t)  and  g{t)  are  single  variable  differentiable  functions.  Find  dz/dx  and  dz/dy  for 
each  of  the  following  two  variable  functions. 

a.  z  =  f{x)g{y) 

h.  z  =  f{xy) 

c.  z  =  f{x/y) 

14.4    The  Chain  Rule 

Consider  the  surface  z  =  x^y  +  xy^,  and  suppose  that  X  =  2  +  and  y  =  1  -  t^.  We  can 
think  of  the  latter  two  equations  as  describing  how  x  and  y  change  relative  to,  say,  time. 
Then 

z  =  x^y  +        =  (2  +  t^f{l  -  t^)  +  (2  +  t^){l  -  t^f 

tells  us  explicitly  how  the  z  coordinate  of  the  corresponding  point  on  the  surface  depends 
on  t.  If  we  want  to  know  dz/dt  we  can  compute  it  more  or  less  directly — it's  actually  a  bit 


=  cos(2;^y)  +  y^ . 
^     xy  ^ 
x^  +  y' 

=  e      "  .  =^ 
=  xy  ln(a;y).  =J> 
=  ^/l  —  x'^  —  y^. 
=  xtan(y).  =^> 
1 

=  — .  =^ 
xy 
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simpler  to  use  the  chain  rule: 

—  =  x^y  +  2xx'y  +  x2yy'  +  x'y'^ 
at 

=  {2xy  +  y'^)x'  +  (x^  +  '2.xy)y' 

=  (2(2  +  t%l  -  t^)  +  (1  -  tY){4t^)  +  ((2  +  t^f  +  2(2  +  t^){l  -  t^))(-3t2) 

If  we  look  carefully  at  the  middle  step,  dz/dt  =  {2xy  +  y'^)x'  +  (x^  +  2xy)y' ,  we  notice  that 
2xy  +  y'^  is  dz/dx,  and  +  2xy  is  dz/dy.  This  turns  out  to  be  true  in  general,  and  gives 
lis  a  new  chain  rule: 

THEOREM  14.4.1  Suppose  that  z  =  /(x,  y),  /  is  differentiable,  x  =  g{t),  and  y  =  h{t). 
Assuming  that  the  relevant  derivatives  exist, 

dz  dz  dx  dz  dy 
dt      dx  dt      dy  dt 

Proof.     If  /  is  differentiable,  then 

Az  =  f^{xo,yo)Ax  +  fy{xo,yo)Ay  +  eiAx  +  e2Ay, 
where  ei  and  62  approach  0  y)  approaches  (xo,yo)-  Then 


Az      ^  Ax      ^  Ay        Ax        Ay  ,  ,  ^  , 

^  =  (14.4.1) 


As  At  approaches  0,  (x,y)  approaches  (xo,yo)  and  so 

, .     Az  dz 

lim  — —  =  — 
At^-o  At  dt 

Ax  dx 
lim  ei  — —  =  0  ■  — 
At^-o    At  dt 

y        Ay  dy 
lim  €2  -r-  =  0  ■  — 
At^o    At  dt 

and  so  taking  the  limit  of  (14.4.1)  as  At  goes  to  0  gives 


dz  dx  dy 
rft  "^"rft  "^^^rft' 


as  desired. 
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We  can  write  the  chain  rule  in  way  that  is  somewhat  closer  to  the  single  variable  chain 
rule: 

^  =  {fxJy)  ■  {x',y'), 

or  (roughly)  the  derivatives  of  the  outside  function  "times"  the  derivatives  of  the  inside 
functions.  Not  surprisingly,  essentially  the  same  chain  rule  works  for  functions  of  more 
than  two  variables,  for  example,  given  a  function  of  three  variables  f{x,y,  z),  where  each 
oi  X,  y  and  z  is  a  function  of  t, 

^  =  ifxjyjz)  ■  {x',y',z'). 

Wc  can  even  extend  the  idea  further.  Suppose  that  /(x,  y)  is  a  function  and  x  =  g{s,  t) 
and  y  —  h{s,t)  are  functions  of  two  variables  s  and  t.  Then  /  is  "really"  a  function  of  s 
and  t  as  well,  and 

-  f       ^  f  u  -  f      ^  f  u 

—  Jx9s  +  Jyi^s  —  Jx9t  +  Jyi^t- 

The  natural  extension  of  this  to  f{x,y,z)  works  as  well. 

Recall  that  we  used  the  ordinary  chain  rule  to  do  implicit  differentiation.  We  can  do 
the  same  with  the  new  chain  rule. 

EXAMPLE  14.4.2  x"^  +y'^  +  z'^  =4  defines  a  sphere,  which  is  not  a  function  of  x  and 
y,  though  it  can  be  thought  of  as  two  functions,  the  top  and  bottom  hemispheres.  We 
can  think  of  z  as  one  of  these  two  functions,  so  really  z  =  z{x,y),  and  we  can  think  of  x 
and  y  as  particularly  simple  functions  of  x  and  y,  and  let  f{x,  y,  z)  =  x'^  +  y'^  +  z'^.  Since 
f{x,  y,  z)  —  4,  df/dx  —  0,  but  using  the  chain  rule: 


^  _  f  ^  ,  f  dy  dz^ 
dx'^'^dx^^^dx^^'dx 

=  {2x){l)  +  {2ym  +  {2z)^., 


noting  that  since  y  is  temporarily  held  constant  its  derivative  dy/dx  =  0.  Now  we  can 
solve  for  dz/dx: 

dz  2x  X 
dx        2z  z 

In  a  similar  manner  we  can  compute  dz/dy.  □ 
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Exercises  14-4- 

1.  Use  the  chain  rule  to  compute  dz/dt  for  z  =  sm{x^  +  y^),  x  =  t'^  +  3,  y  =  . 

2.  Use  the  chain  rule  to  compute  dz/dt  for  z  =  x^y,  x  =  sin(t),  y  =     +  1.  =^ 

3.  Use  the  chain  rule  to  compute  dz/ds  and  dz/dt  for  z  =  x^y,  x  =  sin(st),  y  =  t'^  +  s^.  =^ 

4.  Use  the  chain  rule  to  compute  dz/ds  and  dz/dt  for  z  =  x'^y'^,  x  =  st,  y  =  t^  —  . 

5.  Use  the  chain  rule  to  compute  dz/dx  and  dz/dy  for  2x^  +  3y^  —  1z^  =  9.  =^ 

6.  Use  the  chain  rule  to  compute  dz/dx  and  dz/dy  for  2x'^  +  y'^  +  z'^  =  9. 

7.  Chemistry  students  will  recognize  the  ideal  gas  law,  given  by  PV  =  nRT  which  relates  the 
Pressure,  Volume,  and  Temperature  of  n  moles  of  gas.  (R  is  the  ideal  gas  constant).  Thus, 
we  can  view  pressure,  volume,  and  temperature  as  variables,  each  one  dependent  on  the  other 
two. 

a.  If  pressure  of  a  gas  is  increasing  at  a  rate  of  0.2Pa/min  and  temperature  is  increasing  at 
a  rate  of  liT/min,  how  fast  is  the  volume  changing? 

b.  If  the  volume  of  a  gas  is  decreasing  at  a  rate  of  0.3L/min  and  temperature  is  increasing 
at  a  rate  of  .5K/mm,  how  fast  is  the  pressure  changing? 

c.  If  the  pressure  of  a  gas  is  decreasing  at  a  rate  of  0.4Pa/min  and  the  volume  is  increasing 
at  a  rate  of  SL/min,  how  fast  is  the  temperature  changing? 

8.  Verify  the  following  identity  in  the  case  of  the  ideal  gas  law: 

dVdfdP  ~ 

9.  The  previous  exercise  was  a  special  case  of  the  following  fact,  which  you  are  to  verify  here: 
If  F{x,y,z)  is  a  function  of  3  variables,  and  the  relation  F{x,y,z)  =  0  defines  each  of  the 
variables  in  terms  of  the  other  two,  namely  x  =  f{y,  z),  y  =  g{x,  z)  and  z  =  h{x,  y),  then 

dx  dy  dz  _  ^ 
dy  dz  dx 

14.5    Directional  Derivatives 

We  still  have  not  answered  one  of  our  first  questions  about  the  steepness  of  a  surface: 
starting  at  a  point  on  a  surface  given  by  /(x,  y),  and  walking  in  a  particular  direction,  how 
steep  is  the  surface?  We  are  now  ready  to  answer  the  question. 

We  already  know  roughly  what  has  to  be  done:  as  shown  in  figure  14.3.1,  we  extend  a 
line  in  the  x-y  plane  to  a  vertical  plane,  and  we  then  compute  the  slope  of  the  curve  that 
is  the  cross-section  of  the  surface  in  that  plane.  The  major  stumbling  block  is  that  what 
appears  in  this  plane  to  be  the  horizontal  axis,  namely  the  line  in  the  x-y  plane,  is  not  an 
actual  axis — we  know  nothing  about  the  "units"  along  the  axis.  Our  goal  is  to  make  this 
line  into  a  t  axis;  then  we  need  formulas  to  write  x  and  y  in  terms  of  this  new  variable  t; 
then  we  can  write  z  in  terms  of  t  since  we  know  z  in  terms  of  x  and  y;  and  finally  we  can 
simply  take  the  derivative. 
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So  we  need  to  somehow  "mark  off"  units  on  the  hne,  and  we  need  a  convenient  way 
to  refer  to  the  hne  in  calculations.  It  turns  out  that  we  can  accomplish  both  by  using  the 
vector  form  of  a  line.  Suppose  that  u  is  a  unit  vector  (tti,  M2)  in  the  direction  of  interest.  A 
vector  equation  for  the  line  through  [xq,  yo)  in  this  direction  is  v(t)  =  {uit  +  xq,  U2t  +  i/q). 
The  height  of  the  surface  above  the  point  {uit  +  xo,U2t  +  yo)  is  g{t)  —  f{uit  +  XQ,U2t  +  yo). 
Because  u  is  a  unit  vector,  the  value  of  t  is  precisely  the  distance  along  the  line  from 
{xo,yo)  to  {uit  +  XQ,U2t  +  yo);  this  means  that  the  line  is  effectively  a  t  axis,  with  origin 
at  the  point  (xq,  j/o),  so  the  slope  we  seek  is 

9'{Q)  =  {fx{xo,yo)Jy{xo,yo))  ■  {ui,U2) 
=  V/-U 

Here  we  have  used  the  chain  rule  and  the  derivatives  -^{uit+xo)  =  ui  and  ■^{u2t+yo)  =  U2. 
The  vector  {fx,fy)  is  very  useful,  so  it  has  its  own  symbol,  V/,  pronounced  "del  f";  it  is 
also  called  the  gradient  of  /. 

EXAMPLE  14.5.1  Find  the  slope  of  z  =  x"^  +  y^  at  (1, 2)  in  the  direction  of  the  vector 
(3,4). 

We  first  compute  the  gradient  at  (1,  2):  V/  =  {2x,  2y),  which  is  (2, 4)  at  (1,  2).  A  unit 
vector  in  the  desired  direction  is  (3/5, 4/5),  and  the  desired  slope  is  then  (2, 4)  •  (3/5, 4/5)  = 
6/5  +  16/5  =  22/5.  □ 

EXAMPLE  14.5.2  Find  a  tangent  vector  to  z  =  x'^  +  y'^  at  (1,2)  in  the  direction  of 
the  vector  (3, 4)  and  show  that  it  is  parallel  to  the  tangent  plane  at  that  point. 

Since  (3/5,4/5)  is  a  unit  vector  in  the  desired  direction,  we  can  easily  expand  it  to  a 
tangent  vector  simply  by  adding  the  third  coordinate:  (3/5,4/5,22/5).  To  see  that  this 
vector  is  parallel  to  the  tangent  plane,  we  can  compute  its  dot  product  with  a  normal  to 
the  plane.  We  know  that  a  normal  to  the  tangent  plane  is 

(/,(l,2),/j,(l,2),-l)  =  (2,4,-1), 

and  the  dot  product  is  (2,4,-1)  ■  (3/5,4/5,22/5)  =  6/5  +  16/5  -  22/5  =  0,  so  the  two 
vectors  are  perpendicular.  (Note  that  the  vector  normal  to  the  surface,  namely  {fx,  fy,  — 1), 
is  simply  the  gradient  with  a  —1  tacked  on  as  the  third  component.)  □ 

The  slope  of  a  surface  given  by  ^  =  f{x,  y)  in  the  direction  of  a  (two-dimensional) 
vector  u  is  called  the  directional  derivative  of  /,  written  -Du/-  The  directional  derivative 
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immediately  provides  us  with  some  additional  information.  We  know  that 

i^u/  =  V/-u  =  |V/||u|cos^=  |V/|cos^ 

if  u  is  a  unit  vector;  9  is  the  angle  between  V/  and  u.  This  tells  us  immediately  that  the 
largest  value  of  -Du/  occurs  when  cos  6*  =  1,  namely,  when  6*  =  0,  so  V/  is  parallel  to  u. 
In  other  words,  the  gradient  V/  points  in  the  direction  of  steepest  ascent  of  the  surface, 
and  |V/|  is  the  slope  in  that  direction.  Likewise,  the  smallest  value  of  Du/  occurs  when 
cos^  =  —1,  namely,  when  ^  =  tt,  so  V/  is  anti-parallel  to  u.  In  other  words,  —V/  points 
in  the  direction  of  steepest  descent  of  the  surface,  and  —  |V/|  is  the  slope  in  that  direction. 

EXAMPLE  14.5.3    Investigate  the  direction  of  steepest  ascent  and  descent  for  z  = 

The  gradient  is  {2Xj2y)  —  2{x,y);  this  is  a  vector  parallel  to  the  vector  so  the 

direction  of  steepest  ascent  is  directly  away  from  the  origin,  starting  at  the  point  {x,y). 
The  direction  of  steepest  descent  is  thus  directly  toward  the  origin  from  Note  that 

at  (0,  0)  the  gradient  vector  is  (0,  0),  which  has  no  direction,  and  it  is  clear  from  the  plot 
of  this  surface  that  there  is  a  minimum  point  at  the  origin,  and  tangent  vectors  in  all 
directions  are  parallel  to  the  x-y  plane.  □ 

If  V/  is  perpendicular  to  u,  D^f  =  |V/|  cos(7r/2)  =  0,  since  cos(7r/2)  =  0.  This  means 
that  in  either  of  the  two  directions  perpendicular  to  V/,  the  slope  of  the  surface  is  0;  this 
implies  that  a  vector  in  cither  of  these  directions  is  tangent  to  the  level  curve  at  that  point. 
Starting  with  V/  =  {fx,  fy),  it  is  easy  to  find  a  vector  perpendicular  to  it:  either  {fy,  —  fx) 
or  {-fy,fx)  will  work. 

If  /(a;,  z)  is  a  function  of  three  variables,  all  the  calculations  proceed  in  essentially 
the  same  way.  The  rate  at  which  /  changes  in  a  particular  direction  is  V/  ■  u,  where  now 
V/  =  {fx,  fy,  fz)  and  u  =  {ui,U2,uz)  is  a  unit  vector.  Again  V/  points  in  the  direction  of 
maximum  rate  of  increase,  —V/  points  in  the  direction  of  maximum  rate  of  decrease,  and 
any  vector  perpendicular  to  V/  is  tangent  to  the  level  surface  f{x,y,z)  =  A;  at  the  point 
in  question.  Of  course  there  are  no  longer  just  two  such  vectors;  the  vectors  perpendicular 
to  V/  describe  the  tangent  plane  to  the  level  surface,  or  in  other  words  V/  is  a  normal  to 
the  tangent  plane. 

EXAMPLE  14.5.4  Suppose  the  temperature  at  a  point  in  space  is  given  by  T(x,  y,  z)  = 
To/(l  +x^  +  y'^  +  -2^);  at  the  origin  the  temperature  in  Kelvin  is  Tq  >  0,  and  it  decreases  in 
every  direction  from  there.  It  might  be,  for  example,  that  there  is  a  source  of  heat  at  the 
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origin,  and  as  we  get  farther  from  the  source,  the  temperature  decreases.  The  gradient  is 
^  /  -2Tox  ^  -2Tox  ^  -2Tox 

(1  +  x2  +  y2  _^  ^2^        (1  _^  a;2  _^^2  ^  ^2)2        (1  _|_  3.2  _^  ^2  _f.  ^2)2  / 

-2To 


(1  +  x2  +  y2  +  ^2)2 

The  gradient  points  directly  at  the  origin  from  the  point  (x,  y,  z) — by  moving  directly 
toward  the  heat  source,  we  increase  the  temperature  as  quickly  as  possible.  □ 

EXAMPLE  14.5.5  Find  the  points  on  the  surface  defined  by  +  2y'^  +  =  1  where 
the  tangent  plane  is  parallel  to  the  plane  defined  by  3x  —  y  +  3^  =  1. 

Two  planes  are  parallel  if  their  normals  are  parallel  or  anti-parallel,  so  we  want  to 
find  the  points  on  the  surface  with  normal  parallel  or  anti-parallel  to  (3,-1,3).  Let  /  = 
x^  -|-  2y^  -\-  3z^;  the  gradient  of  /  is  normal  to  the  level  surface  at  every  point,  so  we  are 
looking  for  a  gradient  parallel  or  anti-parallel  to  (3,  —1,  3).  The  gradient  is  (2x,  4y,  6z);  if 
it  is  parallel  or  anti-parallel  to  (3,  —1,  3),  then 

{2x,4y,6z)  =  /c(3,  -1,3) 

for  some  k.  This  means  we  need  a  solution  to  the  equations 

2x  —  3k       4y  =  —k       6z  —  3k 

but  this  is  three  equations  in  four  unknowns — we  need  another  equation.  What  we  haven't 
used  so  far  is  that  the  points  we  seek  are  on  the  surface  x^  -f-  +  3z^  =  1;  this  is  the 
fourth  equation.  If  we  solve  the  first  three  equations  for  x,  y,  and  z  and  substitute  into 
the  fourth  equation  we  get 

-k\^  f3k^'^ 


2 

+2 

2 

3 


4 

k 


+  3 


t.      .2^2   ^,     ,    .    ,     .  ^  3^2     72  72  \  3^2  V2  V2\ 

so  k  =  ±  .  1  he  desned  pomts  are   ,  ,  —    and  ,  — ,  .  You 

5  1510     5/         1510  5/ 

can  see  the  surface  and  all  three  planes  in  the  Java  applet.  □ 
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Exercises  14-5. 

1.  Find  Duf  for  f  =      +  xy  +      in  the  direction  of  u  =  (2, 1)  at  the  point  (1, 1).  =^> 

2.  Find  D^f  for  /  =  sin(xy)  in  the  direction  of  u  =  (  —  1, 1)  at  the  point  (3, 1).  =^ 

3.  Find  D^f  for  f  =  cos(y)  in  the  direction  30  degrees  from  the  positive  x  axis  at  the  point 
(l,7r/4).  ^ 

4.  The  temperature  of  a  thin  plate  in  the  x-y  plane  is  T  =  x'^  +  y'^ .  How  fast  does  temperature 
change  at  the  point  (1,5)  moving  in  a  direction  30  degrees  from  the  positive  x  axis?  =^ 

5.  Suppose  the  density  of  a  thin  plate  at  (x,y)  is  1/ ^/x^  +  y'^  +  1.  Find  the  rate  of  change  of 
the  density  at  (2, 1)  in  a  direction  n/3  radians  from  the  positive  x  axis.  =^ 

6.  Suppose  the  electric  potential  at  {x,  y)  is  In  Y^x^~+~y^.  Find  the  rate  of  change  of  the  potential 
at  (3, 4)  toward  the  origin  and  also  in  a  direction  at  a  right  angle  to  the  direction  toward  the 
origin. 

7.  A  plane  perpendicular  to  the  x-y  plane  contains  the  point  (2,1,8)  on  the  paraboloid  z  = 
X  +  4y^  The  cross-section  of  the  paraboloid  created  by  this  plane  has  slope  0  at  this  point. 
Find  an  equation  of  the  plane. 

8.  A  plane  perpendicular  to  the  x-y  plane  contains  the  point  (3,  2,  2)  on  the  paraboloid  362;  = 
4x'^  +  9y^.  The  cross-section  of  the  paraboloid  created  by  this  plane  has  slope  0  at  this  point. 
Find  an  equation  of  the  plane.  =^ 

9.  Suppose  the  temperature  at  (x,  y,  z)  is  given  by  T  =  xy  -|-  sin(yz).  In  what  direction  should 
you  go  from  the  point  (1, 1, 1)  to  decrease  the  temperature  as  quickly  as  possible?  What  is 
the  rate  of  change  of  temperature  in  this  direction? 

10.  Suppose  the  temperature  at  (x,  y,  z)  is  given  by  T  =  xyz.  In  what  direction  can  you  go  from 
the  point  (1,1,1)  to  maintain  the  same  temperature?  =r- 

11.  Find  an  equation  for  the  plane  tangent  to  x^  —  3y^  -|-     =  7  at  (1, 1,  3).  =^ 

12.  Find  an  equation  for  the  plane  tangent  to  xyz  =  6  at  (1,  2,  3).  =r- 

13.  Find  an  equation  for  the  line  normal  to  x^  -\-  2y^  +  4z^  =  26  at  (2,  —3,  —1). 

14.  Find  an  equation  for  the  line  normal  to  x"^  -\-  y"^  +  9z^  =  56  at  (4,  2,  —2).  =^> 

15.  Find  an  equation  for  the  line  normal  to  x'^  -\-  5y^  —  ^'^  =  0  at  (4,  2,  6). 

16.  Find  the  directions  in  which  the  directional  derivative  of  /(x,  y)  =  -|-  sin(xy)  at  the  point 
(1,0)  has  the  value  1.  =^> 

17.  Show  that  the  curve  r(t)  =  {ln{t),tln(t),t)  is  tangent  to  the  surface  xz'^  —  yz  +  cos(xy)  =  1 
at  the  point  (0,  0, 1). 

18.  A  bug  is  crawling  on  the  surface  of  a  hot  plate,  the  temperature  of  which  at  the  point  x 
units  to  the  right  of  the  lower  left  corner  and  y  units  up  from  the  lower  left  corner  is  given 
by  r(x,y)  =  100  -  x^  -  3y^ 

a.  If  the  bug  is  at  the  point  (2, 1),  in  what  direction  should  it  move  to  cool  off  the  fastest? 
How  fast  will  the  temperature  drop  in  this  direction? 

b.  If  the  bug  is  at  the  point  (1,3),  in  what  direction  should  it  move  in  order  to  maintain  its 
temperature? 

19.  The  elevation  on  a  portion  of  a  hill  is  given  by  /(x,  y)  =  100  —  4x'^  —  2y.  From  the  location 
above  (2, 1),  in  which  direction  will  water  run?  =^ 
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20.  Suppose  that  g{x,  y)  =  y  —  x^.  Find  the  gradient  at  the  point  (—1,  3).  Sketch  the  level  curve 
to  the  graph  of  g  when  g{x,y)  =  2,  and  plot  both  the  tangent  line  and  the  gradient  vector 
at  the  point  (—1,3).  (Make  your  sketch  large).  What  do  you  notice,  geometrically?  =^ 

21.  The  gradient  V/  is  a  vector  valued  function  of  two  variables.  Prove  the  following  gradient 
rules.  Assume  f{x,y)  and  g{x,y)  are  differentiable  functions. 

a.  V(/5)  =  /V(5)+ffV(/) 

b.  V(//9)  =  {gVf  -  fVg)/g^ 

c.  V((/(x,y))")  =  n/(x,y)"-^V/ 

14.6    Higher  order  derivatives 

In  single  variable  calculus  we  saw  that  the  second  derivative  is  often  useful:  in  appropriate 
circumstances  it  measures  acceleration;  it  can  be  used  to  identify  maximum  and  minimum 
points;  it  tells  us  something  about  how  sharply  curved  a  graph  is.  Not  surprisingly,  second 
derivatives  are  also  useful  in  the  multi- variable  case,  but  again  not  surprisingly,  things  are 
a  bit  more  complicated. 

It's  easy  to  see  where  some  complication  is  going  to  come  from:  with  two  variables 
there  are  four  possible  second  derivatives.  To  take  a  "derivative,"  we  must  take  a  partial 
derivative  with  respect  to  x  or  y,  and  there  are  four  ways  to  do  it:  x  then  x,  x  then  y,  y 
then  X,  y  then  y. 

EXAMPLE  14.6.1     Compute  all  four  second  derivatives  of  f{x,y)  =  x^y^. 
Using  an  obvious  notation,  we  get: 

fxx     2?/        fxy     Axy       fyx     4:xy       fyy     2x  . 

□ 

You  will  have  noticed  that  two  of  these  are  the  same,  the  "mixed  partials"  computed 
by  taking  partial  derivatives  with  respect  to  both  variables  in  the  two  possible  orders.  This 
is  not  an  accident — as  long  as  the  function  is  reasonably  nice,  this  will  always  be  true. 

theorem  14.6.2  Clairaut's  Theorem  If  the  mixed  partial  derivatives  are  con- 
tinuous, they  are  equal.  ■ 

EXAMPLE  14.6.3     Compute  the  mixed  partials  of  /  =  xy/(x^  +  y^). 

y^  —  x'^y  x^  —  6x^y^  +  y^ 

■I-  =  (x2+y2)2  -l^y  =  (x2+y2)3 

We  leave  fyj;  as  an  exercise.  □ 
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Exercises  14-6. 

1.  Let  /  =  xy/{x^  +  V^)]  compute  fxx,  fyx,  and  fyy.  =^ 

2.  Find  all  first  and  second  partial  derivatives  of  x^y'^  +     .  =^ 

3.  Find  all  first  and  second  partial  derivatives  of  Ax^  +  xy^  +  10. 

4.  Find  all  first  and  second  partial  derivatives  of  x  sin  y.  =^ 

5.  Find  all  first  and  second  partial  derivatives  of  sin(3x)  cos(2y).  =^ 

I  2 

6.  Find  all  first  and  second  partial  derivatives  of  . 

7.  Find  all  first  and  second  partial  derivatives  of  In  ^x"^  +  y^. 

8.  Find  all  first  and  second  partial  derivatives  of  z  with  respect  to  x  and  y  if  x^+4y'^  +  162;'^ — 64  = 
0. 

9.  Find  all  first  and  second  partial  derivatives  of  z  with  respect  to  x  and  y  if  xy  +  yz  +  xz  =  1. 

=^ 

10.  Let  a  and  k  be  constants.  Prove  that  the  function  u{x,  t)  =  e-"     '  sm{kx)  is  a  solution  to 
the  heat  equation  ut  =  a'^Uxx 

11.  Let  a  be  a  constant.  Prove  that  u  =  sm(x  —  at) +ln(x  +  at)  is  a  solution  to  the  wave  equation 
utt  =  a^Uxx- 

12.  How  many  third-order  derivatives  does  a  function  of  2  variables  have?  How  many  of  these 
are  distinct? 

13.  How  many  nth  order  derivatives  does  a  function  of  2  variables  have?  How  many  of  these  are 
distinct? 

14.7    Maxima  and  minima 

Suppose  a  surface  given  by  /(x,  y)  has  a  local  maximum  at  (xq,  yo-,  zq)]  geometrically,  this 
point  on  the  surface  looks  like  the  top  of  a  hill.  If  we  look  at  the  cross-section  in  the 
plane  y  =  yo,  we  v^^ill  see  a  local  maximum  on  the  curve  at  (xq^zq),  and  we  know  from 
single-variable  calculus  that  =  0  at  this  point.  Likewise,  in  the  plane  x  =  xq,  f|  =  0. 
So  if  there  is  a  local  maximum  at  (xo,yo 5^0)5  both  partial  derivatives  at  the  point  must 
be  zero,  and  likewise  for  a  local  minimum.  Thus,  to  find  local  maximum  and  minimum 
points,  we  need  only  consider  those  points  at  which  both  partial  derivatives  are  0.  As  in 
the  single-variable  case,  it  is  possible  for  the  derivatives  to  be  0  at  a  point  that  is  neither 
a  maximum  or  a  minimum,  so  we  need  to  test  these  points  further. 

You  will  recall  that  in  the  single  variable  case,  we  examined  three  methods  to  identify 
maximum  and  minimum  points;  the  most  useful  is  the  second  derivative  test,  though  it 
does  not  always  work.  For  functions  of  two  variables  there  is  also  a  second  derivative  test; 
again  it  is  by  far  the  most  useful  test,  though  it  doesn't  always  work. 

THEOREM  14.7.1  Suppose  that  the  second  partial  derivatives  of  f{x,y)  are  contin- 
uous near  {xo,yo),  and  fx{xo,yo)  =  fy{xo,yo)  =  0.  We  denote  by  D  the  discriminant 
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Dixo.yo)  =  fxx{xo,yo)fyy{xo,yo)  -  fxyixo,yo)'^.  If  D  >  0  and  fxx{xo,yo)  <  0  there  is 
a  local  maximum  at  (xo.yo);  if  D  >  0  and  fxx{xo,yo)  >  0  there  is  a  local  minimum  at 
(xo^yo)]  if  D  <  0  there  is  neither  a  maximum  nor  a  minimum  at  (xo^yo);  if  -D  =  0,  the 
test  fails.  ■ 

EXAMPLE  14.7.2    Verify  that  f{x,  y)  =  x'^  +  y'^  has  a  minimum  at  (0,  0). 
First,  we  compute  all  the  needed  derivatives: 

fx   ~  2X  fy   =  2y  fxx   —  2  fyy   =  2  fxy   =  0. 

The  derivatives  fx  and  fy  are  zero  only  at  (0,  0).  Applying  the  second  derivative  test  there: 

1^(0,  0)  =  /,,(0,  Q)fyy{Q.  0)  -  /,j,(0,  0)2  =  2  •  2  -  0  =  4  >  0, 

so  there  is  a  local  minimum  at  (0,  0),  and  there  are  no  other  possibilities.  □ 

EXAMPLE  14.7.3     Find  all  local  maxima  and  minima  for  /(x,  y)  =  x"^  —  y"^. 
The  derivatives: 

fx   ~  2X  fy   =        2y  fxx   —  2  fyy   =        2  fxy   =  0. 

Again  there  is  a  single  critical  point,  at  (0,0),  and 

1^(0,  0)  =  fxxiO,  0)fyy{0,  0)  -  fxy{0,  0)2  =  2  ■  -2  -  0  =  -4  <  0, 

so  there  is  neither  a  maximum  nor  minimum  there,  and  so  there  are  no  local  maxima  or 
minima.  The  surface  is  shown  in  figure  14.7.1.  □ 

EXAMPLE  14.7.4    Find  all  local  maxima  and  minima  for  /(x,  y)  =      +  y^. 
The  derivatives: 

fx  ~  4x        fy  =  Ay        fxx  —  12x        fyy  =  12y        fxy  =  0. 

Again  there  is  a  single  critical  point,  at  (0,  0),  and 

D{0,  0)  =  fxxiO,  0)fyy{0,  0)  -  fxy{0,  0)2  =  0  ■  0  -  0  =  0, 

so  we  get  no  information.  However,  in  this  case  it  is  easy  to  see  that  there  is  a  minimum 
at  (0,0),  because  /(0,0)  =  0  and  at  all  other  points  f{x,y)  >  0.  □ 

EXAMPLE  14.7.5     Find  all  local  maxima  and  minima  for  /(x,  y)  =  x^  +  y^. 
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Figure  14.7.1     A  saddle  point,  neither  a  maximum  nor  a  minimum.  (AP) 
The  derivatives: 

fx  —  3x        fy  =  3y        fxx  —  6x        fyy  =  Gy        fxy  =  0. 

Again  there  is  a  single  critical  point,  at  (0,  0),  and 

1^(0,  0)  =  /,,(0,  0)/y,(0,  0)  -  /,y(0,  0)2  =  0  ■  0  -  0  =  0, 

so  we  get  no  information.  In  this  case,  a  little  thought  shows  there  is  neither  a  maximum 
nor  a  minimum  at  (0,0):  when  x  and  y  are  both  positive,  f{x,y)  >  0,  and  when  x  and 
y  are  both  negative,  f{x,y)  <  0,  and  there  are  points  of  both  kinds  arbitrarily  close  to 
(0,0).  Alternately,  if  we  look  at  the  cross-section  when  y  =  0,  we  get  f{x,0)  =  x^,  which 
does  not  have  either  a  maximum  or  minimum  at  x  =  0.  □ 


EXAMPLE  14.7.6  Suppose  a  box  with  no  top  is  to  hold  a  certain  volume  V.  Find  the 
dimensions  for  the  box  that  result  in  the  minimum  surface  area. 

The  area  of  the  box  is  A  =  2hw  +  2hl  +  Iw,  and  the  volume  is  y  =  Iwh,  so  we  can 
write  the  area  as  a  function  of  two  variables, 

2V  2V 

A{l,w)  =  —  H  +lw. 

I  w 
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Then 


Ai  =  - 


2V 

72" 


+  w    and  A. 


■w 


2V 


w 


If  we  set  these  equal  to  zero  and  solve,  we  find  w 
corresponding  height  is  h  =  V/{2V)'^/^ . 
The  second  derivatives  are 


(2y)^/3  and  /  =  (2V)^/^  and  the 


Au  = 


v4, 


■WW 


4V 


w 


3 


Ai 


.w 


1, 


so  the  discriminant  is 

W  W 

L'  =  -r^-l  =  4-l  =  3>0. 

Since  An  is  2,  there  is  a  local  minimum  at  the  critical  point.  Is  this  a  global  minimum? 
It  is,  but  it  is  difficult  to  see  this  analytically;  physically  and  graphically  it  is  clear  that 
there  is  a  minimum,  in  which  case  it  must  be  at  the  single  critical  point.  Here  is  the  graph 
as  rendered  by  Sage,  as  an  example.  Note  that  we  must  choose  a  value  for  V  in  order  to 
graph  it.  □ 

Recall  that  when  we  did  single  variable  global  maximum  and  minimum  problems,  the 
easiest  cases  were  those  for  which  the  variable  could  be  limited  to  a  finite  closed  interval, 
for  then  we  simply  had  to  check  all  critical  values  and  the  endpoints.  The  previous  example 
is  difficult  because  there  is  no  finite  boundary  to  the  domain  of  the  problem — both  w  and 
/  can  be  in  (0,  00).  As  in  the  single  variable  case,  the  problem  is  often  simpler  when  there 
is  a  finite  boundary. 

THEOREM  14.7.7  If  /(x,  y)  is  continuous  on  a  closed  and  bounded  subset  of  M^,  then 
it  has  both  a  maximum  and  minimum  value.  ■ 

As  in  the  case  of  single  variable  functions,  this  means  that  the  maximum  and  minimum 
values  must  occur  at  a  critical  point  or  on  the  boundary;  in  the  two  variable  case,  however, 
the  boundary  is  a  curve,  not  merely  two  endpoints. 

EXAMPLE  14.7.8  The  length  of  the  diagonal  of  a  box  is  to  be  1  meter;  find  the 
maximum  possible  volume. 

If  the  box  is  placed  with  one  corner  at  the  origin,  and  sides  along  the  axes,  the  length 
of  the  diagonal  is  a/x^  +     +  z^,  and  the  volume  is 


V  =  xyz  =  xy\/l  —  x'^  —  y'^. 
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Clearly,  +  <  1,  so  the  domain  we  are  interested  in  is  the  quarter  of  the  unit  disk  in 
the  first  quadrant.  Computing  derivatives: 


_  y  -  2yx^  -  y 


y 


X  —  2xy'^  —  x^ 
a/1  -  x2  -  y2 


If  these  are  both  0,  then  x  =  Oory  =  0,  orx  =  y  =  l/ \/3.  The  boundary  of  the  domain  is 
composed  of  three  curves:  x  =  0  for  y  G  [0, 1];  y  =  0  for  x  G  [0, 1];  and  x^  +  =  1,  where 
X  >  0  and  y  >  0.  In  all  three  cases,  the  volume  xy^/l  —  x^  —  is  0,  so  the  maximum 
occurs  at  the  only  critical  point  (l/-\/3,  l/y/S,  -\/3/9).  See  figure  14.7.2.  □ 


Figure  14.7.2     The  volume  of  a  box  with  fixed  length  diagonal. 


Exercises  1^.7. 

1.  Find  all  local  maximum 

2.  Find  all  local  maximum 

3.  Find  all  local  maximum 

4.  Find  all  local  maximum 

5.  Find  all  local  maximum 


and  minimum  points  of  / 
and  minimum  points  of  / 
and  minimum  points  of  / 
and  minimum  points  of  / 
and  minimum  points  of  / 


+  4y^  -  2x  +  8y  -  1.  ^ 
a;^  -  +  6x  -  lOy  +  2.  ^ 
xy. 

9  +  4:X-y-2x^  -  3y^.  =^ 
x^  +  4xy  +  y^  —  6y  +  1.  =^> 
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6.  Find  all  local  maximum  and  minimum  points  of  f  =      —  xy  +  2y^  —  5x  +  6y  —  9. 

7.  Find  the  absolute  maximum  and  minimum  points  of  /  =  +  3y  —  3xy  over  the  region 
bounded  by  y  =  x,  y  =  0,  and  x  =  2.  =^> 

8.  A  six-sided  rectangular  box  is  to  hold  1/2  cubic  meter;  what  shape  should  the  box  be  to 
minimize  surface  area?  =^> 

9.  The  post  office  will  accept  packages  whose  combined  length  and  girth  is  at  most  130  inches 
(girth  is  the  maximum  distance  around  the  package  perpendicular  to  the  length).  What  is 
the  largest  volume  that  can  be  sent  in  a  rectangular  box?  =^> 

10.  The  bottom  of  a  rectangular  box  costs  twice  as  much  per  unit  area  as  the  sides  and  top. 
Find  the  shape  for  a  given  volume  that  will  minimize  cost.  =^ 

11.  Using  the  methods  of  this  section,  find  the  shortest  distance  from  the  origin  to  the  plane 
X  +  y  +  z  =  10.  =^ 

12.  Using  the  methods  of  this  section,  find  the  shortest  distance  from  the  point  {xo,yo,  zo)  to 
the  plane  ax  +  by  +  cz  =  d.  You  may  assume  that  c  7^  0;  use  of  Sage  or  similar  software  is 
recommended. 

13.  A  trough  is  to  be  formed  by  bending  up  two  sides  of  a  long  metal  rectangle  so  that  the 
cross-section  of  the  trough  is  an  isosceles  trapezoid,  as  in  figure  6.2.6.  If  the  width  of  the 
metal  sheet  is  2  meters,  how  should  it  be  bent  to  maximize  the  volume  of  the  trough?  =^ 

14.  Given  the  three  points  (1,4),  (5,2),  and  (3,  -2),  (x  -  1)^  +  {y  -  4)^  +  (x  -  5)^  +  (y  -  2)^  + 
(x  —  3)^  +  (y  +  2)^^  is  the  sum  of  the  squares  of  the  distances  from  point  (x,  y)  to  the  three 
points.  Find  x  and  y  so  that  this  quantity  is  minimized.  =^ 

15.  Suppose  that  /(x,  y)  =  x^  +  y^  +  kxy.  Find  and  classify  the  critical  points,  and  discuss  how 
they  change  when  k  takes  on  different  values. 

16.  Find  the  shortest  distance  from  the  point  (0,  6)  to  the  parabola  y  =  x^. 

17.  Find  the  shortest  distance  from  the  point  (0,  0,  b)  to  the  paraboloid  z  =  x'^  +  y'^.  =^ 

18.  Consider  the  function  /(x,  y)  =  x^  —  3x^y  +  y^. 

a.  Show  that  (0,  0)  is  the  only  critical  point  of  /. 

b.  Show  that  the  discriminant  test  is  inconclusive  for  /. 

c.  Determine  the  cross-sections  of  /  obtained  by  setting  y  =  kx  for  various  values  of  k. 

d.  What  kind  of  critical  point  is  (0,  0)? 

19.  Find  the  volume  of  the  largest  rectangular  box  with  edges  parallel  to  the  axes  that  can  be 
inscribed  in  the  ellipsoid  2x^  +  72y^  +  18z^  =  288.  =^ 

14.8    Lagrange  Multipliers 

Many  applied  max/min  problems  take  the  form  of  the  last  two  examples:  we  want  to 
find  an  extreme  value  of  a  function,  like  V  =  xyz,  subject  to  a  constraint,  like  1  = 
-|-  -(-  z^.  Often  this  can  be  done,  as  we  have,  by  explicitly  combining  the  equations 
and  then  finding  critical  points.  There  is  another  approach  that  is  often  convenient,  the 
method  of  Lagrange  multipliers. 
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It  is  somewhat  easier  to  understand  two  variable  problems,  so  we  begin  with  one  as 
an  example.  Suppose  the  perimeter  of  a  rectangle  is  to  be  100  units.  Find  the  rectangle 
with  largest  area.  This  is  a  fairly  straightforward  problem  from  single  variable  calculus. 
We  write  down  the  two  equations:  A  =  xy,  P  =  100  =  2x  +  2y,  solve  the  second  of 
these  for  y  (or  x),  substitute  into  the  first,  and  end  up  with  a  one- variable  maximization 
problem.  Let's  now  think  of  it  differently:  the  equation  A  =  xy  defines  a  surface,  and  the 
equation  100  =  2x  +  2y  defines  a  curve  (a  line,  in  this  case)  in  the  x-y  plane.  If  we  graph 
both  of  these  in  the  three-dimensional  coordinate  system,  we  can  phrase  the  problem  like 
this:  what  is  the  highest  point  on  the  surface  above  the  line?  The  solution  we  already 
understand  effectively  produces  the  equation  of  the  cross-section  of  the  surface  above  the 
line  and  then  treats  it  as  a  single  variable  problem.  Instead,  imagine  that  we  draw  the 
level  curves  (the  contour  lines)  for  the  surface  in  the  x-y  plane,  along  with  the  line. 

50 
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Figure  14.8.1     Constraint  line  with  contour  plot  of  the  surface  xy. 

Imagine  that  the  line  represents  a  hiking  trail  and  the  contour  lines  are,  as  on  a 
topographic  map,  the  lines  of  constant  altitude.  How  could  you  estimate,  based  on  the 
graph,  the  high  (or  low)  points  on  the  path?  As  the  path  crosses  contour  lines,  you  know 
the  path  must  be  increasing  or  decreasing  in  elevation.  At  some  point  you  will  see  the  path 
just  touch  a  contour  line  (tangent  to  it),  and  then  begin  to  cross  contours  in  the  opposite 
order — that  point  of  tangency  must  be  a  maximum  or  minimum  point.  If  we  can  identify 
all  such  points,  we  can  then  check  them  to  see  which  gives  the  maximum  and  which  the 
minimum  value.  As  usual,  we  also  need  to  check  boundary  points;  in  this  problem,  we 
know  that  x  and  y  are  positive,  so  we  are  interested  in  just  the  portion  of  the  line  in  the 
first  quadrant,  as  shown.  The  endpoints  of  the  path,  the  two  points  on  the  axes,  are  not 
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points  of  tangency,  but  they  are  the  two  places  that  the  function  xy  is  a  minimum  in  the 
first  quadrant. 

How  can  we  actuaUy  make  use  of  this?  At  the  points  of  tangency  that  we  seek,  the 
constraint  curve  (in  this  case  the  hne)  and  the  level  curve  have  the  same  slope — their 
tangent  lines  are  parallel.  This  also  means  that  the  constraint  curve  is  perpendicular  to 
the  gradient  vector  of  the  function;  going  a  bit  further,  if  we  can  express  the  constraint 
curve  itself  as  a  level  curve,  then  we  seek  the  points  at  which  the  two  level  curves  have 
parallel  gradients.  The  curve  100  =  2x  +  2y  can  be  thought  of  as  a  level  curve  of  the 
function  2x  +  2y;  figure  14.8.2  shows  both  sets  of  level  curves  on  a  single  graph.  We  are 
interested  in  those  points  where  two  level  curves  are  tangent — but  there  are  many  such 
points,  in  fact  an  infinite  number,  as  we've  only  shown  a  few  of  the  level  curves.  All  along 
the  line  y  =  x  are  points  at  which  two  level  curves  are  tangent.  While  this  might  seem  to 
be  a  show-stopper,  it  is  not. 


Figure  14.8.2     Contour  plots  for  2x  +  2y  and  xy. 

The  gradient  of  2x  +  2y  is  (2,2),  and  the  gradient  of  xy  is  {y,x).  They  are  parallel 
when  (2,  2)  =  A(y,  x),  that  is,  when  2  =  Xy  and  2  =  Ax.  We  have  two  equations  in  three 
unknowns,  which  typically  results  in  many  solutions  (as  we  expected).  A  third  equation  will 
reduce  the  number  of  solutions;  the  third  equation  is  the  original  constraint,  100  =  2x  +  2y. 
So  we  have  the  following  system  to  solve: 


2  =  Xy       2  =  Xx       100  =  2x  +  2y. 
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In  the  first  two  equations,  A  can't  be  0,  so  we  may  divide  by  it  to  get  x  =  y  =  2/\. 
Substituting  into  the  third  equation  we  get 

2  2 

2-  +2-  =  100 


100 

so  X  =  y  =  2b.  Note  that  we  are  not  really  interested  in  the  value  of  A — it  is  a  clever 
tool,  the  Lagrange  multiplier,  introduced  to  solve  the  problem.  In  many  cases,  as  here,  it 
is  easier  to  find  A  than  to  find  everything  else  without  using  A. 

The  same  method  works  for  functions  of  three  variables,  except  of  course  everything 
is  one  dimension  higher:  the  function  to  be  optimized  is  a  function  of  three  variables  and 
the  constraint  represents  a  surface — for  example,  the  function  may  represent  temperature, 
and  we  may  be  interested  in  the  maximum  temperature  on  some  surface,  like  a  sphere. 
The  points  we  seek  are  those  at  which  the  constraint  surface  is  tangent  to  a  level  surface  of 
the  function.  Once  again,  we  consider  the  constraint  surface  to  be  a  level  surface  of  some 
function,  and  we  look  for  points  at  which  the  two  gradients  are  parallel,  giving  us  three 
equations  in  four  unknowns.  The  constraint  provides  a  fourth  equation. 

EXAMPLE  14.8.1     Recall  example  14.7.8:  the  diagonal  of  a  box  is  1,  we  seek  to 
maximize  the  volume.  The  constraint  is  1  =  ^/a;^  +     +  z^,  which  is  the  same  as  1  = 
+     +  z^.  The  function  to  maximize  is  xyz.  The  two  gradient  vectors  are  (2x,  2y,  2z) 
and  {yz,  xz,  xy),  so  the  equations  to  be  solved  are 

yz  =  2xX 
xz  =  2yX 
xy  =  2zX 
I  =  x'^  +  y"^  +  z'^ 

If  A  =  0  then  at  least  two  of  x,  y,  z  must  be  0,  giving  a  volume  of  0,  which  will  not  be  the 
maximum.  If  we  multiply  the  first  two  equations  by  x  and  y  respectively,  we  get 

xyz  =  2x^A 
xyz  =  2y^X 

so  2x^X  =  2y'^X  or  =  y^;  in  the  same  way  we  can  show  =  z"^.  Hence  the  fourth 
equation  becomes  1  =  x^  +  x^  +  or  x  =  1/ v^,  and  sox  =  y  =  z  =  l/ a/3  gives  the 
maximum  volume.  This  is  of  course  the  same  answer  we  obtained  previously.  □ 
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Another  possibility  is  that  we  have  a  function  of  three  variables,  and  we  want  to 
find  a  maximum  or  minimum  value  not  on  a  surface  but  on  a  curve;  often  the  curve 
is  the  intersection  of  two  surfaces,  so  that  we  really  have  two  constraint  equations,  say 
g{x,y,z)  =  ci  and  h{x,y,z)  =  C2-  It  turns  out  that  at  points  on  the  intersection  of  the 
surfaces  where  /  has  a  maximum  or  minimum  value, 

V/  =  AV^  +  nVh. 

As  before,  this  gives  us  three  equations,  one  for  each  component  of  the  vectors,  but  now 
in  five  unknowns,  x,  y,  z,  A,  and  fi.  Since  there  are  two  constraint  functions,  we  have  a 
total  of  five  equations  in  five  unknowns,  and  so  can  usually  find  the  solutions  we  need. 

EXAMPLE  14.8.2  The  plane  x  +  y  —  z  =  1  intersects  the  cyUnder  +  =  1  in  an 
ellipse.  Find  the  points  on  the  ellipse  closest  to  and  farthest  from  the  origin. 

We  want  the  extreme  values  of  /  =  a/x^  +  +  2^  subject  to  the  constraints  g  — 
x"^  +  y'^  =  1  and  h  =  x  +  y  —  z  =  l.  To  simplify  the  algebra,  we  may  use  instead 
f  =  x'^  +  y'^  +  z'^ ,  since  this  has  a  maximum  or  minimum  value  at  exactly  the  points  at 
which  ^y x"^  +  y"^  +  z'^  does.  The  gradients  are 

V/  =  (2x,  2y,  2z)       Vg  =  (2x,  2y,  0)       Vh  =  (1, 1,  -1), 

so  the  equations  we  need  to  solve  are 

2x  =  A2x  +  fi 
2y  =  X2y  +  fi 
2z  =  0-^ 

1  =  + 

1  =  X  +  y  —  z. 

Subtracting  the  first  two  we  get  2y  —  2x  =  \(2y  —  2x),  so  either  A  =  1  or  x  =  y.  If  A  =  1 
then  |U  =  0,  so  z  =  0  and  the  last  two  equations  are 

1  =  x^  +  and       1  =  x  +  y. 

Solving  these  gives  x  =  1,  y  =  0,  or  x  =  0,  y  =  1,  so  the  points  of  interest  are  (1,0,0) 
and  (0, 1,0),  which  are  both  distance  1  from  the  origin.  If  x  =  y,  the  fourth  equation  is 
2x^  =  1,  giving  x  =  y  =  ±l/-\/2,  and  from  the  fifth  equation  we  get  z  =  —1  ±  \/2.  The 
distance  from  the  origin  to  (l/-\/2,  l/-\/2,  —1  +  \f2)  is  a/4  —  2\f2  ^  1.08  and  the  distance 
from  the  origin  to  (-1/^2,  -1/v^,  -1  -  \f2)  is  ^4  +  2^2  ^  2.6.  Thus,  the  points  (1,  0,  0) 
and  (0, 1,0)  are  closest  to  the  origin  and  (— 1/-\/2,  — 1/-\/2,  —1  —  -\/2)  is  farthest  from  the 
origin.  The  Java  applet  shows  the  cylinder,  the  plane,  the  four  points  of  interest,  and  the 
origin.  □ 
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Exercises  14-8. 

1.  A  six-sided  rectangular  box  is  to  hold  1/2  cubic  meter;  what  shape  should  the  box  be  to 
minimize  surface  area?  =^> 

2.  The  post  office  will  accept  packages  whose  combined  length  and  girth  are  at  most  130  inches 
(girth  is  the  maximum  distance  around  the  package  perpendicular  to  the  length).  What  is 
the  largest  volume  that  can  be  sent  in  a  rectangular  box?  =^> 

3.  The  bottom  of  a  rectangular  box  costs  twice  as  much  per  unit  area  as  the  sides  and  top. 
Find  the  shape  for  a  given  volume  that  will  minimize  cost.  =^> 

4.  Using  Lagrange  multipliers,  find  the  shortest  distance  from  the  point  (xo,  yo,  zo)  to  the  plane 
ax  +  by  +  cz  =  d.  =^ 

5.  Find  all  points  on  the  surface  xy  —     -\-  1  =  0  that  are  closest  to  the  origin.  =^ 

6.  The  material  for  the  bottom  of  an  aquarium  costs  half  as  much  as  the  high  strength  glass  for 
the  four  sides.  Find  the  shape  of  the  cheapest  aquarium  that  holds  a  given  volume  V.  =^ 

7.  The  plane  x  —  y  +  z  =  2  intersects  the  cylinder  x'^  +  y'^  =  4  in  an  ellipse.  Find  the  points  on 
the  ellipse  closest  to  and  farthest  from  the  origin. 

8.  Find  three  positive  numbers  whose  sum  is  48  and  whose  product  is  as  large  as  possible. 

9.  Find  all  points  on  the  plane  x  +  y  +  2;  =  5in  the  first  octant  at  which  /(x,  y,  z)  =  xy'^z^  has 
a  maximum  value.  =J> 

10.  Find  the  points  on  the  surface  x^  —  yz  =  5  that  are  closest  to  the  origin. 

11.  A  manufacturer  makes  two  models  of  an  item,  standard  and  deluxe.  It  costs  $40  to  manu- 
facture the  standard  model  and  $60  for  the  deluxe.  A  market  research  firm  estimates  that  if 
the  standard  model  is  priced  at  x  dollars  and  the  deluxe  at  y  dollars,  then  the  manufacturer 
will  sell  500(y  —  x)  of  the  standard  items  and  45,  000  +  500(x  —  2y)  of  the  deluxe  each  year. 
How  should  the  items  be  priced  to  maximize  profit?  =^ 

12.  A  length  of  sheet  metal  27  inches  wide  is  to  be  made  into  a  water  trough  by  bending  up  two 
sides  as  shown  in  figure  14.8.3.  Find  x  and  (f>  so  that  the  trapezoid-shaped  cross  section  has 
maximum  area.  =^> 

X  / 

27  -2x 

Figure  14.8.3     Cross-section  of  a  trough. 

13.  Find  the  maximum  and  minimum  values  of  /(x,  y,  z)  =  6x  +  3y +  2^;  subject  to  the  constraint 
g{x,  y,  z)  =  4x^  +  2y^  +     -  70  =  0.  ^ 

14.  Find  the  maximum  and  minimum  values  of  f{x,y)  =  e^^  subject  to  the  constraint  g{x,y)  = 
x^  +  y^  -  16  =  0. 

15.  Find  the  maximum  and  minimum  values  of  /(x,  y)  =  xy  +  \/9  —  x'^  —  y^  when  x^  +  y'^  <  9. 

16.  Find  three  real  numbers  whose  sum  is  9  and  the  sum  of  whose  squares  is  a  small  as  possible. 

17.  Find  the  dimensions  of  the  closed  rectangular  box  with  maximum  volume  that  can  be  in- 
scribed in  the  unit  sphere. 


15 

Multiple  Integration 


15.1    Volume  and  Average  Height 


Consider  a  surface  f{x,y);  you  might  temporarily  think  of  this  as  representing  physical 
topography — a  hilly  landscape,  perhaps.  What  is  the  average  height  of  the  surface  (or 
average  altitude  of  the  landscape)  over  some  region? 

As  with  most  such  problems,  we  start  by  thinking  about  how  we  might  approximate 
the  answer.  Suppose  the  region  is  a  rectangle,  [a,  6]  x  [c,d].  We  can  divide  the  rectangle 
into  a  grid,  m  subdivisions  in  one  direction  and  n  in  the  other,  as  indicated  in  figure  15.1.1. 
We  pick  X  values  xq,  xi,. . . ,  Xm-i  in  each  subdivision  in  the  x  direction,  and  similarly  in 
the  y  direction.  At  each  of  the  points  {xi,yj)  in  one  of  the  smaller  rectangles  in  the  grid, 
we  compute  the  height  of  the  surface:  /(x^,  yj).  Now  the  average  of  these  heights  should 
be  (depending  on  the  fineness  of  the  grid)  close  to  the  average  height  of  the  surface: 


mn 

As  both  m  and  n  go  to  infinity,  we  expect  this  approximation  to  converge  to  a  fixed 
value,  the  actual  average  height  of  the  surface.  For  reasonably  nice  functions  this  does 
indeed  happen. 

Using  sigma  notation,  we  can  rewrite  the  approximation: 


f{xo,  yo)  +  f{xi,  yo)  H  1-  /(a^o,  Vi)  +  /(a^i,  yi)  H  ^-  /(a^m-i,  Vn-i) 


1 


n—  1  m—  1 


1 


n—1  m  —  1 


b  —  a  d  —  c 


mn 


{h-a){d~c) 


m 


n 


i=0  j=0 


i=0  j=0 


n—  1  m  —  1 


383 


384     Chapter  15  Multiple  Integration 


d 

y2 
yi 

c 


a      Xl     X2     Xs     X4     X5  b 

Figure  15.1.1     A  rectangular  subdivision  of  [a,b]  x  [c,d]. 

The  two  parts  of  this  product  have  useful  meaning:  {b  —  a){d  —  c)  is  of  course  the  area  of 
the  rectangle,  and  the  double  sum  adds  up  mn  terms  of  the  form  f{xj,yi)AxAy,  which  is 
the  height  of  the  surface  at  a  point  times  the  area  of  one  of  the  small  rectangles  into  which 
we  have  divided  the  large  rectangle.  In  short,  each  term  f{xj,  yi)AxAy  is  the  volume  of  a 
tall,  thin,  rectangular  box,  and  is  approximately  the  volume  under  the  surface  and  above 
one  of  the  small  rectangles;  see  figure  15.1.2.  When  we  add  all  of  these  up,  we  get  an 
approximation  to  the  volume  under  the  surface  and  above  the  rectangle  R  =  [a,  b]  x  [c,  d] . 
When  we  take  the  limit  as  m  and  n  go  to  infinity,  the  double  sum  becomes  the  actual 
volume  under  the  surface,  which  we  divide  by  (6  —  a){d  —  c)  to  get  the  average  height. 


Ay 


Ax 


Figure  15.1.2     Approximating  the  volume  under  a  surface. 
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Double  sums  like  this  come  up  in  many  applications,  so  in  a  way  it  is  the  most  impor- 
tant part  of  this  example;  dividing  by  {b  —  a){d  —  c)  is  a  simple  extra  step  that  allows  the 
computation  of  an  average.  As  we  did  in  the  single  variable  case,  we  introduce  a  special 
notation  for  the  limit  of  such  a  double  sum: 


n  —  l  m— 1 


n^oo  ^  ^  f{xj,yi)AxAy  ^  jj  f{x,  y)dxdy  =  jj  f{x,  y)  dA, 


i=0  j=Q 


R  R 


the  double  integral  of  /  over  the  region  R.  The  notation  dA  indicates  a  small  bit  of 
area,  without  specifying  any  particular  order  for  the  variables  x  and  y;  it  is  shorter  and 
more  "generic"  than  writing  dx  dy.  The  average  height  of  the  surface  in  this  notation  is 


R 


(b-a)(d 

The  next  question,  of  course,  is:  How  do  we  compute  these  double  integrals?  You 
might  think  that  we  will  need  some  two-dimensional  version  of  the  Fundamental  Theorem 
of  Calculus,  but  as  it  turns  out  we  can  get  away  with  just  the  single  variable  version, 
applied  twice. 

Going  back  to  the  double  sum,  we  can  rewrite  it  to  emphasize  a  particular  order  in 
which  we  want  to  add  the  terms: 

n  —  l  1  m—1  \ 

i=o  \j=o  I 

In  the  sum  in  parentheses,  only  the  value  of  Xj  is  changing;  yi  is  temporarily  constant.  As 
m  goes  to  infinity,  this  sum  has  the  right  form  to  turn  into  an  integral: 

m—l 


III,     J.  „o 

Jim^  X]  ^^^i'  yi)^x  =  /   f{x,  yi)  dx. 


m 

j=0 


So  after  we  take  the  limit  as  m  goes  to  infinity,  the  sum  is 

n-l  /  „b 


Of  course,  for  different  values  of  y^  this  integral  has  different  values;  in  other  words,  it  is 
really  a  function  applied  to  yf. 


G{y)  =  /  f{x,y)dx. 

J  a 
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If  we  substitute  back  into  the  sum  we  get 


n-l 


This  sum  has  a  nice  interpretation.  The  value  G{yi)  is  the  area  of  a  cross  section  of  the 
region  under  the  surface  f{x,y),  namely,  when  y  =  yi.  The  quantity  G{yi)Ay  can  be 
interpreted  as  the  volume  of  a  solid  with  face  area  G{yi)  and  thickness  Ay.  Think  of  the 
surface  /(x,  y)  as  the  top  of  a  loaf  of  sliced  bread.  Each  slice  has  a  cross-sectional  area  and 
a  thickness;  G{yi)Ay  corresponds  to  the  volume  of  a  single  slice  of  bread.  Adding  these 
up  approximates  the  total  volume  of  the  loaf.  (This  is  very  similar  to  the  technique  we 
used  to  compute  volumes  in  section  9.3,  except  that  there  we  need  the  cross-sections  to  be 
in  some  way  "the  same".)  Figure  15.1.3  shows  this  "sliced  loaf"  approximation  using  the 
same  surface  as  shown  in  figure  15.1.2.  Nicely  enough,  this  sum  looks  just  like  the  sort  of 
sum  that  turns  into  an  integral,  namely. 


Let's  be  clear  about  what  this  means:  we  first  will  compute  the  inner  integral,  temporarily 
treating  y  as  a  constant.  We  will  do  this  by  finding  an  anti-derivative  with  respect  to 
X,  then  substituting  x  =  a  and  x  =  b  and  subtracting,  as  usual.  The  result  will  be  an 
expression  with  no  x  variable  but  some  occurrences  of  y.  Then  the  outer  integral  will  be 
an  ordinary  one-variable  problem,  with  y  as  the  variable. 

EXAMPLE  15.1.1  Figure  15.1.2  shows  the  function  sin{xy)+6/5  on  [0.5,  3.5]  x  [0.5,  2.5]. 
The  volume  under  this  surface  is 


n-l 


d 


i=0 


The  inner  integral  is 


3.5 


cos(xy)  6x 


3.5 


cos(3.5y) 


cos(0.52/)  18 


sm{xy)  +  -  dx 


y  5 


+ 


y  5 


0.5 


0.5 


y 


Unfortunately,  this  gives  a  function  for  which  we  can't  find  a  simple  anti-derivative.  To 
complete  the  problem  we  could  use  Sage  or  similar  software  to  approximate  the  integral. 
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Figure  15.1.3     Approximating  the  volume  under  a  surface  with  slices. 

Doing  this  gives  a  volume  of  approximately  8.84,  so  the  average  height  is  approximately 
8.84/6  ^  1.47.  □ 

Because  addition  and  multiplication  are  commutative  and  associative,  we  can  rewrite 
the  original  double  sum: 

n  —  lm—l  m—ln—l 
i=0  j=0  j=0  i=0 

Now  if  we  repeat  the  development  above,  the  inner  sum  turns  into  an  integral: 


r.  Jc 


dy, 


i=0 

and  then  the  outer  sum  turns  into  an  integral: 

"t-l  /   rd  \  pb  pd 

^^I^oo  ^   \J  ^^j  ^  J  J 

In  other  words,  we  can  compute  the  integrals  in  either  order,  first  with  respect  to  x  then 
y,  or  vice  versa.  Thinking  of  the  loaf  of  bread,  this  corresponds  to  slicing  the  loaf  in  a 
direction  perpendicular  to  the  first. 

We  haven't  really  proved  that  the  value  of  a  double  integral  is  equal  to  the  value  of  the 
corresponding  two  single  integrals  in  either  order  of  integration,  but  provided  the  function 
is  reasonably  nice,  this  is  true;  the  result  is  called  Pubini's  Theorem. 
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EXAMPLE  15.1.2    We  compute  J  J  1  +  (a;  -  1)^  +  4y^  dA,  where  R  =  [0,  3]  x  [0,  2],  in 

R 

two  ways. 
First, 


/    /  l  +  {x-lf +iy''dydx=  [  y  +  {x-lfy  +  ^y 
Jo  Jo  Jo 

,        s9     32  , 
=  /   2  +  2(a;  -  1)2  +  —  rfa; 
Jo  3 


dx 


2,        ,s  32 
=  2a;  +  -(x  -  1)"^  +  —X 


=  6+^-8+|-3-(0-l-^  +  0) 
=  44. 


In  the  other  order: 


2  /.3 


0  ^0 


1  +  (x  -  1)2  +  4?/2  dx  dy 


X  + 


fx -11 


+  Ay^x 


dy 


3  +  -  +  12^2  J,-dy 


-  3y+  -y  +  4y^  +  -y 

=  6+1^  +  32  +  ^ 
3  3 

=  44. 


□ 

In  this  example  there  is  no  particular  reason  to  favor  one  direction  over  the  other; 
in  some  cases,  one  direction  might  be  much  easier  than  the  other,  so  it's  usually  worth 
considering  the  two  different  possibilities. 

Frequently  we  will  be  interested  in  a  region  that  is  not  simply  a  rectangle.  Let's 
compute  the  volume  under  the  surface  x  +  2y'^  above  the  region  described  by  0  <  x  <  1 
and  0  <  y  <  x^,  shown  in  figure  15.1.4. 

In  principle  there  is  nothing  more  difficult  about  this  problem.  If  we  imagine  the  three- 
dimensional  region  under  the  surface  and  above  the  parabolic  region  as  an  oddly  shaped 
loaf  of  bread,  we  can  still  slice  it  up,  approximate  the  volume  of  each  slice,  and  add  these 
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Figure  15.1.4     A  parabolic  region  of  integration. 

volumes  up.  For  example,  if  we  slice  perpendicular  to  the  x  axis  at  Xi,  the  thickness  of  a 
slice  will  be  Ax  and  the  area  of  the  slice  will  be 


Xi  +  2y  dy. 


When  we  add  these  up  and  take  the  limit  as  Ax  goes  to  0,  we  get  the  double  integral 


1  rx 


0  ^0 


X  +  2y  dy  dx 


2  . 
0  <J 


dx 


x^  +  -x^  dx 


x^      I  . 

 1  X 

4  21 


0 

29 


1      2  _ 
4  ^  21  ~  84 


We  could  just  as  well  slice  the  solid  perpendicular  to  the  y  axis,  in  which  case  we  get 


1  /.I 


X  +  2y^  dx  dy 


0 


+  2y^x 


dy 


Vv 


l  +  2y^-^-2y^^dy 


2/^23 

— — y  

2     3^  4 


7/2 


_  1  2  1  4  _  29 
~2^3"4~7^84' 


What  is  the  average  height  of  the  surface  over  this  region?  As  before,  it  is  the  volume 
divided  by  the  area  of  the  base,  but  now  we  need  to  use  integration  to  compute  the  area 
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of  the  base,  since  it  is  not  a  simple  rectangle.  The  area  is 

■3?  dx 


L 


'  1 


3' 

so  the  average  height  is  29/28. 

EXAMPLE  15.1.3  Find  the  volume  under  the  surface  z  =  \/\  —  x"^  and  above  the 
triangle  formed  by  j/  =     x  =  1,  and  the  x-axis. 

Let's  consider  the  two  possible  ways  to  set  this  up: 

/    /    \/\  —  x"^  dydx       or         /     /    \/\  —  x'^  dx  dy. 
Jo  Jo  Jo  Jy 

Which  appears  easier?  In  the  first,  the  first  (inner)  integral  is  easy,  because  we  need  an 

anti-derivative  with  respect  to  y,  and  the  entire  integrand  a/i  —  is  constant  with  respect 
to  y.  Of  course,  the  second  integral  may  be  more  difficult.  In  the  second,  the  first  integral 
is  mildly  unpleasant — a  trig  substitution.  So  let's  try  the  first  one,  since  the  first  step  is 
easy,  and  see  where  that  leaves  us. 

/    /    \/l  —  x'^  dydx  —  /    y\/ 1  —  x"^     dx  —  j    x\/l  —  x^  dx. 
Jo  Jo  Jo  0  Jo 

This  is  quite  easy,  since  the  substitution  u  =  1  —  x^  works: 

J  x\J\-x'^  dx  =  -^J  ^fudu  =  ^u^/^  =  -1(1  -  a;2)3/2. 


Then 


Jo  3 


1 


This  is  a  good  example  of  how  the  order  of  integration  can  affect  the  complexity  of  the 
problem.  In  this  case  it  is  possible  to  do  the  other  order,  but  it  is  a  bit  messier.  In 
some  cases  one  order  may  lead  to  a  very  difficult  or  impossible  integral;  it's  usually  worth 
considering  both  possibilities  before  going  very  far.  □ 
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Exercises  15.1. 

1.  Compute  /     /    \  +  xdydx. 
Jo  Jo 


'0  JO 


2.  Compute  J   J  x  +  ydydx. 

/2  ry 
I    xydxdy.  =^ 


'1  Jo 


4.  Compute  /     /  dxdy. 

Jo  Jy'2/2 

5.  Compute  /     /  —dydx. 
Ji  Ji 

2 


f  r  y 

6.  Compute  /     /     — dydx.  =^> 

Jo  Jo 

7.  Compute  /  /     x  cosy  dydx.  = 

Jo  Jo 

/'7r/2     pcos  9 

8.  Compute  /        /  {cos  6  — r)dr 

Jo  Jo 

9.  Compute:    /     /     \/x'^  +  Idx  dy.  =^ 

Jo  Jyfy 

10.  Compute:    /  /  ysm{x^)dxdy. 

Jo  Jy'^ 

11.  Compute:    /  /    x\/ 1  +  y^  dy  dx  ^ 

Jo  Jx'2 

ri    ry  2 

12.  Compute:    /     /    —^=^=  dx  dy  =^ 

Jo  Jo  \/l-x'^ 


/•I  /-s  ^2 

13.  Compute:    /     /    e'^  dxdy  =^ 

Jo  Jsy 

14.  Compute  /      /         x  —  ^/ydydx. 

J-i  Jo 


15.  Compute  /  /   xdydx.  =^> 

Jo  J -^Jl-2x'^ 

16.  Evaluate  dA  over  the  region  in  the  first  quadrant  bounded  by  the  hyperbola  xy  =  16 


and  the  lines  y  =  x,  y  =  0,  and  x  =  8. 

17.  Find  the  volume  below  z  =  1  —  y  above  the  region  —1  <a;<l,  0<y<l  —  x^.  =^ 

18.  Find  the  volume  bounded  by  z  =  a;^  +  y'^  and  z  =  4.  =^> 

19.  Find  the  volume  in  the  first  octant  bounded  by  y'^  =  4  —  x  and  y  =  2z.  =^ 

20.  Find  the  volume  in  the  first  octant  bounded  by  y^  =  4a;,  2x  +  y  =  4,  z  =  y,  and  y  =  0. 
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21.  Find  the  volume  in  the  first  octant  bounded  hy  x  +  y  +  z  =  9,  2x  +  3y  =  18,  and  x  +  3y  =  9. 

=^ 

22.  Find  the  volume  in  the  first  octant  bounded  by  x'^  +  y'^  =      and  z  =  x  +  y.  =^ 

23.  Find  the  volume  bounded  by  4a;^  +  y'^  =  4z  and  z  =  2.  =^> 

24.  Find  the  volume  bounded  hy  z  =  x^  +  y'^  and  z  =  y. 

25.  Find  the  volume  under  the  surface  z  =  xy  above  the  triangle  with  vertices  (1, 1,  0),  (4, 1,  0), 
(1,2,0).  ^ 

26.  Find  the  volume  enclosed  by  y  =  x'^,  y  =  4,  z  =  x'^,  z  =  0.  ^ 

27.  A  swimming  pool  is  circular  with  a  40  meter  diameter.  The  depth  is  constant  along  east- west 
lines  and  increases  linearly  from  2  meters  at  the  south  end  to  7  meters  at  the  north  end. 
Find  the  volume  of  the  pool.  =^ 

28.  Find  the  average  value  of  f{x,y)  =  \Jx  ^  on  the  rectangle  with  vertices  (0,0),  (4,0), 
(4,1)  and  (0,1).  ^ 

29.  Figure  15.1.5  shows  a  temperature  map  of  Colorado.  Use  the  data  to  estimate  the  average 
temperature  in  the  state  using  4,  16  and  25  subdivisions.  Give  both  an  upper  and  lower 
estimate.  Why  do  we  like  Colorado  for  this  problem?  What  other  state(s)  might  we  like? 


Current  Temperatures  Air  Sports  Net  Weathei 


Figure  15.1.5     Colorado  temperatures. 


30.  Three  cylinders  of  radius  1  intersect  at  right  angles  at  the  origin,  as  shown  in  figure  15.1.6. 
Find  the  volume  contained  inside  all  three  cylinders.  =^ 

31.  Prove  that  if  f{x,y)  is  integrable  and  if  g{x,y)  =         /    f{s,t)  dt  ds  then  gxy  =  Qyx  = 

J  a  Jb 

f{x,y). 

32.  Reverse  the  order  of  integration  on  each  of  the  following  integrals 

a.    /     /  f{x,y)dxdy 
Jo  Jo 

/2      rln  X 
J  f{x,y)dydx 
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15.2    Double  Integrals  in  Cylindrical  Coordinates 

Suppose  we  have  a  surface  given  in  cylindrical  coordinates  as  z  =  f{r,  9)  and  we  wish  to  find 
the  integral  over  some  region.  We  could  attempt  to  translate  into  rectangular  coordinates 
and  do  the  integration  there,  but  it  is  often  easier  to  stay  in  cylindrical  coordinates. 

How  might  we  approximate  the  volume  under  such  a  surface  in  a  way  that  uses  cylin- 
drical coordinates  directly?  The  basic  idea  is  the  same  as  before:  we  divide  the  region  into 
many  small  regions,  multiply  the  area  of  each  small  region  by  the  height  of  the  surface 
somewhere  in  that  little  region,  and  add  them  up.  What  changes  is  the  shape  of  the  small 
regions;  in  order  to  have  a  nice  representation  in  terms  of  r  and  9,  we  use  small  pieces 
of  ring-shaped  shown  in  figure  15.2.1.  Each  small  region  is  roughly  rectangular, 

except  that  two  sides  are  segments  of  a  circle  and  the  other  two  sides  are  not  quite  parallel. 
Near  a  point  {r,9),  the  length  of  either  circular  arc  is  about  rA9  and  the  length  of  each 
straight  side  is  simply  Ar.  When  Ar  and  A^  are  very  small,  the  region  is  nearly  a  rectangle 
with  area  rArA^,  and  the  volume  under  the  surface  is  approximately 


$]$^/(r„^,)r,ArA^. 
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In  the  limit,  this  turns  into  a  double  integral 


00  Jro 


f{r,  9)r  dr  d9. 


Figure  15.2.1     A  cylindrical  coordinates  "grid". 


EXAMPLE  15.2.1  Find  the  volume  under  z  =  \J Al  —  r'^  above  the  quarter  circle 
bounded  by  the  two  axes  and  the  circle      +     =  4  in  the  first  quadrant. 

In  terms  of  r  and  ^,  this  region  is  described  by  the  restrictions  0  <  r  <  2  and  0  <  ^  < 
7r/2,  so  we  have 


/  /  V4-r2  rdrde  =  /  --{A-r^fl'^ 
Jo     Jo  Jo  3 

.7r/2  g 


de 


"  y 

The  surface  is  a  portion  of  the  sphere  of  radius  2  centered  at  the  origin,  in  fact  exactly 
one-eighth  of  the  sphere.  We  know  the  formula  for  volume  of  a  sphere  is  (4/3)7rr^,  so  the 
volume  we  have  computed  is  (l/8)(4/3)7r2"^  =  (4/3)7r,  in  agreement  with  our  answer.  □ 

This  example  is  much  like  a  simple  one  in  rectangular  coordinates:  the  region  of 
interest  may  be  described  exactly  by  a  constant  range  for  each  of  the  variables.  As  with 
rectangular  coordinates,  we  can  adapt  the  method  to  deal  with  more  complicated  regions. 
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EXAMPLE  15.2.2  Find  the  volume  under  z  =  a/4  —  above  the  region  enclosed  by 
the  curve  r  =  2cos^,  —tx/2  <  9  <  n/2;  see  figure  15.2.2.  The  region  is  described  in  polar 
coordinates  by  the  inequalities  —n/2  <  9  <  n/2  and  0  <  r  <  2cos^,  so  the  double  integral 
is 

f.7r/2  /•2cos( 


/7r/2      /•2cos5    /"''r/^  pzcosa 

/         ■\/4  -  r2  rdrd9  =  2  /  r  dr  d9. 

--K/2J0  Jo  Jo 


We  can  rewrite  the  integral  as  shown  because  of  the  symmetry  of  the  volume;  this  avoids 
a  complication  during  the  evaluation.  Proceeding: 


i'Tt/2    j>2cosb  /"'""/^  1 

2/        /         ^A-r^  rdrd9  =  2  /   {A-r'^f/'^ 

Jo      Jo  Jo  3 

sin^  9+-d9 


r/2 


2  cose 


d9 


tt/2 


8  cos^  9 

-< 

3  3 

8 

32 

=  -n 

3 

~  "9"' 

-  cos6'  +  -9 


r/2 


□ 


Figure  15.2.2     Volume  over  a  region  with  non-constant  limits. 

You  might  have  learned  a  formula  for  computing  areas  in  polar  coordinates.  It  is 
possible  to  compute  areas  as  volumes,  so  that  you  need  only  remember  one  technique. 
Consider  the  surface  ^  =  1,  a  horizontal  plane.  The  volume  under  this  surface  and  above 
a  region  in  the  x-y  plane  is  simply  1  •  (area  of  the  region),  so  computing  the  volume  really 
just  computes  the  area  of  the  region. 
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EXAMPLE  15.2.3  Find  the  area  outside  the  circle  r  =  2  and  inside  r  =  4sin^;  see 
figure  15.2.3.  The  region  is  described  byyr/G  <  6  <  Stt/G  and  2  <  r  <  4sin^,  so  the 
integral  is 


□ 


Figure  15.2.3     Finding  area  by  computing  volume. 

Exercises  15.2. 

1.  Find  the  volume  above  the  x-y  plane,  under  the  surface      =  2z,  and  inside  r  =  2.  =^ 

2.  Find  the  volume  inside  both  r  =  1  and  r'^  +      =  A.  =^ 

3.  Find  the  volume  below  z  =  \/l  —  r'^  and  above  the  top  half  of  the  cone  z  =  r.  =^> 

4.  Find  the  volume  below  z  =  r,  above  the  x-y  plane,  and  inside  r  =  cosO.  =^> 

5.  Find  the  volume  below  z  =  r,  above  the  x-y  plane,  and  inside  r  =  1  +  cos  9. 

6.  Find  the  volume  between  x  +  y  =  z   and  x  +  y  =  z.  =^> 

7.  Find  the  area  inside  r  =  1  +  sin  9  and  outside  r  =  2sm9. 

8.  Find  the  area  inside  both  r  =  2sm9  and  r  =  2cos0.  =^ 

9.  Find  the  area  inside  the  four-leaf  rose  r  =  cos(2^)  and  outside  r  =  1/2.  =^ 

10.  Find  the  area  inside  the  cardioid  r  =  2(1  +  cos9)  and  outside  r  =  2.  =^ 

11.  Find  the  area  of  one  loop  of  the  three-leaf  rose  r  =  cos(30).  =^ 

12.  Compute  /      /  sin(x  -\- y  )  dy  da;  by  converting  to  cylindrical  coordinates.  =^ 

>/-3  ^0 

r  f°  2 

13.  Compute  /     /  x  y  dy  da;  by  converting  to  cylindrical  coordinates.  =^> 

Jo     J -^a'^-x.'^ 
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14.  Find  the  volume  under  z  =      +  x  +  2  above  the  region      +      <  4  =^ 

15.  Find  the  volume  between  z  =  x^y^  and  z  =  1  above  the  region      +  y'^  <  1  =^ 

16.  Find  the  volume  inside  x'^  +  y"^  =  1  and  x'^  +  z'^  =  1. 

17.  Find  the  volume  under  z  =  r  above  r  =  3  +  cos  6. 

18.  Figure  15.2.4  shows  the  plot  of  r  =  1  +  4sin(50). 


Figure  15.2.4     r  =  1  +  4sin(56') 

a.  Describe  the  behavior  of  the  graph  in  terms  of  the  given  equation.  Specifically,  explain 
maximum  and  minimum  values,  number  of  leaves,  and  the  'leaves  within  leaves'. 

b.  Give  an  integral  or  integrals  to  determine  the  area  outside  a  smaller  leaf  but  inside  a 
larger  leaf. 

c.  How  would  changing  the  value  of  a  in  the  equation  r  =  1  +  acos(50)  change  the  relative 
sizes  of  the  inner  and  outer  leaves?  Focus  on  values  a  >  1.  (Hint:  How  would  we  change 
the  maximum  and  minimum  values?) 

19.  Consider  the  integral  /  /  — .  dA,  where  D  is  the  unit  disk  centered  at  the  origin.  (See 

JJ  ^x^  +  y2 

D  ^ 

the  graph  here.) 

a.  Why  might  this  integral  be  considered  improper? 

b.  Calculate  the  value  of  the  integral  of  the  same  function  1/^/x'^  +  y'^  over  the  annulus 
with  outer  radius  1  and  inner  radius  5. 

c.  Obtain  a  value  for  the  integral  on  the  whole  disk  by  letting  6  approach  0.  =^ 

d.  For  which  values  A  can  we  replace  the  denominator  with  (x^+y"^)^  in  the  original  integral? 
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Using  a  single  integral  we  were  able  to  compute  the  center  of  mass  for  a  one- dimensional 
object  with  variable  density,  and  a  two  dimensional  object  with  constant  density.  With  a 
double  integral  we  can  handle  two  dimensions  and  variable  density. 
Just  as  before,  the  coordinates  of  the  center  of  mass  are 
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where  M  is  the  total  mass,  My  is  the  moment  around  the  y-axis,  and  Mj;  is  the  moment 
around  the  x-axis.  (You  may  want  to  review  the  concepts  in  section  9.6.) 

The  key  to  the  computation,  just  as  before,  is  the  approximation  of  mass.  In  the  two- 
dimensional  case,  we  treat  density  a  as  mass  per  square  area,  so  when  density  is  constant, 
mass  is  (density) (area).  If  we  have  a  two-dimensional  region  with  varying  density  given 
by  (t{x,  y),  and  we  divide  the  region  into  small  subregions  with  area  AA,  then  the  mass  of 
one  subregion  is  approximately  a{xi,yj)AA,  the  total  mass  is  approximately  the  sum  of 
many  of  these,  and  as  usual  the  sum  turns  into  an  integral  in  the  limit: 

M  =         /     a{x,y)  dydx, 

JxQ  Jyo 

and  similarly  for  computations  in  cylindrical  coordinates.  Then  as  before 

r^i  ryi 
=         /     ycr{x,y)  dydx 

Jxf)  Jyo 

r^i  rVi 
~  /      /  xa{x,y)dydx. 
J^o  Jyo 


EXAMPLE  15.3.1  Find  the  center  of  mass  of  a  thin,  uniform  plate  whose  shape  is 
the  region  between  y  =  cosx  and  the  x-axis  between  x  =  —n/2  and  x  =  tt/2.  Since  the 
density  is  constant,  we  may  take  a(x,  y)  =  1. 

It  is  clear  that  x  =  0,  but  for  practice  let's  compute  it  anyway.  First  we  compute  the 
mass: 

■k/2     fcosx  A'7r/2 
/        /  /    /  ^2 


M  =  I        /        ldydx=  cosxfix  =  sinx|_^  ,2  =  2. 

I-/2  Jo  J-n/2 


Next, 


»7r/2     y-cosx  /.7r/2   ^  ^ 

=  I        /       y  dydx  =  -  cos^  x  dx 


Finally, 


/2  Jo  J-Tr/2  2  4 

/7r/2  /-cosx 
/       xdydx=  xcosxdx  —  0. 

-n/2  Jo  J-n/2 

So  X  =  0  as  expected,  and  y  =  tv/4/2  =  tv/8.  This  is  the  same  problem  as  in  example  9.6.4; 
it  may  be  helpful  to  compare  the  two  solutions.  □ 

EXAMPLE  15.3.2  Find  the  center  of  mass  of  a  two-dimensional  plate  that  occupies 
the  quarter  circle  x^  -|-      <  1  in  the  first  quadrant  and  has  density  /c(x^  +  y^).  It  seems 
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clear  that  because  of  the  symmetry  of  both  the  region  and  the  density  function  (both  are 
important!),  x  =  y.  We'U  do  both  to  check  our  work. 
Jumping  right  in: 

M=  I     I  k{x'^  +  y'^)  dy  dx  =  k      x'^Vl-x'^  +  -  — dx. 

Jo  Jo  Jo  3 

This  integral  is  something  we  can  do,  but  it's  a  bit  unpleasant.  Since  everything  in  sight 
is  related  to  a  circle,  let's  back  up  and  try  polar  coordinates.  Then  x'^  +     =  and 


7r/2    fl  1-77/2  ^4 


1  /-/2  1  ^ 

de  =  k      -de  =  k-. 

0  4  8 


M  =  /    k{r^)rdrd6  =  k 

Jo  Jo 

Much  better.  Next,  since  y  =  rsin^, 

M^  =  k  /       sine  dr  dO  =  k         -  sin 9  d9  =  k  —  cos  6' 


r/2 


k 

5' 


Similarly, 


r/2     .1  .7^/2  ^  ^ 

My  =  k  I       I   r^cosOdrde  =  k  I       -  cosO  dO  =  k  -  sinO 
'0      Jo  Jo 


5  5 


tt/2 


k 

5' 


g 

Finally,  x  =  y  =  — .  □ 
Exercises  15.3. 

1.  Find  the  center  of  mass  of  a  two-dimensional  plate  that  occupies  the  square  [0, 1]  x  [0, 1]  and 
has  density  function  xy.  =^ 

2.  Find  the  center  of  mass  of  a  two-dimensional  plate  that  occupies  the  triangle  0  <  x  <  1, 
0  <  y  <  a;,  and  has  density  function  xy. 

3.  Find  the  center  of  mass  of  a  two-dimensional  plate  that  occupies  the  upper  unit  semicircle 
centered  at  (0,  0)  and  has  density  function  y.  =^ 

4.  Find  the  center  of  mass  of  a  two-dimensional  plate  that  occupies  the  upper  unit  semicircle 
centered  at  (0,0)  and  has  density  function  x^ .  =^ 

5.  Find  the  center  of  mass  of  a  two-dimensional  plate  that  occupies  the  triangle  formed  by 
X  =  2,  y  =  X,  and  y  =  2x  and  has  density  function  2x. 

6.  Find  the  center  of  mass  of  a  two-dimensional  plate  that  occupies  the  triangle  formed  by 
X  =  0,  y  =  X,  and  2x  +  y  =  6  and  has  density  function  x'^ .  =^> 

7.  Find  the  center  of  mass  of  a  two-dimensional  plate  that  occupies  the  region  enclosed  by  the 
parabolas  x  =     ,  y  =  x^  and  has  density  function  y/x.  =^ 
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8.  Find  the  centroid  of  the  area  in  the  first  quadrant  bounded  by  —  8y  +  4  =  0,  x'^  =  4y,  and 
X  =  0.  (Recall  that  the  centroid  is  the  center  of  mass  when  the  density  is  f  everywhere.) 

9.  Find  the  centroid  of  one  loop  of  the  three- leaf  rose  r  =  cos(30).  (Recall  that  the  centroid  is 
the  center  of  mass  when  the  density  is  1  everywhere,  and  that  the  mass  in  this  case  is  the 
same  as  the  area,  which  was  the  subject  of  exercise  11  in  section  15.2.)  The  computations  of 
the  integrals  for  the  moments  Mx  and  My  are  elementary  but  quite  long;  Sage  can  help. 

10.  Find  the  center  of  mass  of  a  two  dimensional  object  that  occupies  the  region  0  <  x  <  tt, 
0  <  y  <  sinx,  with  density  a  =  1.  =^ 

11.  A  two-dimensional  object  has  shape  given  by  r  =  1  +  cos9  and  density  a{r,6)  =  2  +  cos9. 
Set  up  the  three  integrals  required  to  compute  the  center  of  mass.  =^ 

12.  A  two-dimensional  object  has  shape  given  by  r  =  cosO  and  density  a{r,9)  =  r  +  1.  Set  up 
the  three  integrals  required  to  compute  the  center  of  mass.  =^> 

13.  A  two-dimensional  object  sits  inside  r  =  1  +  cos  9  and  outside  r  =  cos  9,  and  has  density  1 
everywhere.  Set  up  the  integrals  required  to  compute  the  center  of  mass.  =^> 


We  next  seek  to  compute  the  area  of  a  surface  above  (or  below)  a  region  in  the  x-y  plane. 
How  might  we  approximate  this?  We  start,  as  usual,  by  dividing  the  region  into  a  grid 
of  small  rectangles.  We  want  to  approximate  the  area  of  the  surface  above  one  of  these 
small  rectangles.  The  area  is  very  close  to  the  area  of  the  tangent  plane  above  the  small 
rectangle.  If  the  tangent  plane  just  happened  to  be  horizontal,  of  course  the  area  would 
simply  be  the  area  of  the  rectangle.  For  a  typical  plane,  however,  the  area  is  the  area  of 
a  parallelogram,  as  indicated  in  figure  15.4.1.  Note  that  the  area  of  the  parallelogram  is 
obviously  larger  the  more  "tilted"  the  tangent  plane.  In  the  Java  applet  you  can  see  that 
viewed  from  above  the  four  parallelograms  exactly  cover  a  rectangular  region  in  the  x-y 
plane. 

Now  recall  a  curious  fact:  the  area  of  a  parallelogram  can  be  computed  as  the  cross 
product  of  two  vectors  (page  315).  We  simply  need  to  acquire  two  vectors,  parallel  to 
the  sides  of  the  parallelogram  and  with  lengths  to  match.  But  this  is  easy:  in  the  x 
direction  we  use  the  tangent  vector  we  already  know,  namely  (1,  0,  fx)  and  multiply  by  Ax 
to  shrink  it  to  the  right  size:  (Ax,  0,  fx  Ax).  In  the  y  direction  we  do  the  same  thing  and 
get  {0,  Ay,  fyAy) .  The  cross  product  of  these  vectors  is  {fx,fy,—l)AxAy  with  length 


Y +     +  1  the  area  of  the  parallelogram.  Now  we  add  these  up  and  take  the 

limit,  to  produce  the  integral 


15.4    Surface  Area 


As  before,  the  limits  need  not  be  constant. 
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Figure  15.4.1     Small  parallelograms  at  points  of  tangency. 

EXAMPLE  15.4.1  We  find  the  area  of  tfie  liemispliere  z  =  a/1  —  —  .  We  compute 
tlie  derivatives 

-y 


fx, 


-X 


'1  -  X2  -  2/2 


fx 


1  —  x2  —  J/2 


and  tlien  tlie  area  is 


1  rVi- 


+ 


1  —  x2  —  ?/2       \  —       —  y 


+  ldy  dx. 


-1  J -^/l-x^ 

Tliis  is  a  bit  on  tlie  messy  side,  but  we  can  use  polar  coordinates: 

r.27r  /.I 


0  JO 


1 


1  -  r2 


r  dr  d9. 


This  integral  is  improper,  since  the  function  is  undefined  at  the  limit  1.  We  therefore 
compute 


lim 


1 


1  —  r" 


r  dr  =  lim  — vl  — a2  +  l  =  l. 


using  the  substitution  u  =  1  —  r'^ .  Then  the  area  is 

f.27r 

1  de  =  2iT. 


You  may  recall  that  the  area  of  a  sphere  of  radius  r  is  47rr^,  so  half  the  area  of  a  unit 
sphere  is  (l/2)47r  =  2n,  in  agreement  with  our  answer.  □ 
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Exercises  15.4- 

1.  Find  the  area  of  the  surface  of  a  right  circular  cone  of  height  h  and  base  radius  a. 

2.  Find  the  area  of  the  portion  of  the  plane  z  =  mx  inside  the  cylinder      -\-  y'^  =  a^. 

3.  Find  the  area  of  the  portion  of  the  plane  x  +  y  +  z  =  lm  the  first  octant.  =^ 

4.  Find  the  area  of  the  upper  half  of  the  cone  x  +y   =  z   inside  the  cylinder  x  +y  —2x  =  0. 

=^ 

5.  Find  the  area  of  the  upper  half  of  the  cone  x'^  +  y'^  =  z'^  above  the  interior  of  one  loop  of 
r  =  cos(26').  =^ 

6.  Find  the  area  of  the  upper  hemisphere  of  x'^  +     +     =  1  above  the  interior  of  one  loop  of 
r  =  cos(26').  =^ 

7.  The  plane  ax  +  by  +  cz  =  d  cuts  a  triangle  in  the  first  octant  provided  that  a,  b,  c  and  d  are 
all  positive.  Find  the  area  of  this  triangle.  =^ 

8.  Find  the  area  of  the  portion  of  the  cone  x'^  +  y"^  =  2>z^  lying  above  the  xy  plane  and  inside 
the  cylinder  x^  ^y^  —  4y. 


15.5    Triple  Integrals 

It  will  come  as  no  surprise  that  we  can  also  do  triple  integrals — integrals  over  a  three- 
dimensional  region.  The  simplest  application  allows  us  to  compute  volumes  in  an  alternate 
way. 

To  approximate  a  volume  in  three  dimensions,  we  can  divide  the  three-dimensional 
region  into  small  rectangular  boxes,  each  Ax  x  Ay  x  Az  with  volume  AxAyA^;.  Then  we 
add  them  all  up  and  take  the  limit,  to  get  an  integral: 

I      I  dzdydx. 

Of  course,  if  the  limits  are  constant,  we  are  simply  computing  the  volume  of  a  rectangular 
box. 

EXAMPLE  15.5.1  We  use  an  integral  to  compute  the  volume  of  the  box  with  opposite 
corners  at  (0,0,0)  and  (1,2,3). 

/     /     /   dzdydx=        /    z\Qdydx=        /   3dydx=       Sy]"^  dx  =  /   6dx  =  6. 
Jo  Jo  Jo  Jo  Jo  Jo  Jo  Jo  Jo 

□ 

Of  course,  this  is  more  interesting  and  useful  when  the  limits  are  not  constant. 

EXAMPLE  15.5.2    Find  the  volume  of  the  tetrahedron  with  corners  at  (0,  0,  0) ,  (0,  3,  0) , 

(2,3,0),  and  (2,3,5). 
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The  whole  problem  comes  down  to  correctly  describing  the  region  by  inequalities: 
0<x<2,  3x/2<y<3,  0<2;<5x/2.  The  lower  y  limit  comes  from  the  equation  of 
the  line  y  =  3x/2  that  forms  one  edge  of  the  tetrahedron  in  the  x-y  plane;  the  upper  z 
limit  comes  from  the  equation  of  the  plane  z  =  5x/2  that  forms  the  "upper"  side  of  the 
tetrahedron;  see  figure  15.5.1.  Now  the  volume  is 


2     /.3  p5x/2 

0  J3X/2  Jo 


dzdydx=  /     /  zl^^^'^dydx 

Jo  J-ix/2 


2 


hx 


dy  dx 


0  J2,x/2 

hx 


dx 


3x/2 


^  15x  15x2 


dx 


0 


15x2  15x'^ 


4  12 
=  15  -  10  =  5. 


□ 

Pretty  much  just  the  way  we  did  for  two  dimensions  we  can  use  triple  integration  to 
compute  mass,  center  of  mass,  and  various  average  quantities. 

EXAMPLE  15.5.3  Suppose  the  temperature  at  a  point  is  given  by  T  =  xyz.  Find  the 
average  temperature  in  the  cube  with  opposite  corners  at  (0,  0,  0)  and  (2,  2,  2). 

In  two  dimensions  we  add  up  the  temperature  at  "each"  point  and  divide  by  the  area; 
here  we  add  up  the  temperatures  and  divide  by  the  volume,  8: 


1 


0  ~'o  ^0 


xyz  dz  dy  dx 


0  JO 


xyz 


dy  dx 


xy^ 


dx 


1 

16 

4iX'  ditjc 


xy  dy  dx 


0  JO 

1  x2 

2  y 


1. 


□ 


EXAMPLE  15.5.4  Suppose  the  density  of  an  object  is  given  by  xz,  and  the  object 
occupies  the  tetrahedron  with  corners  (0,0,0),  (0,1,0),  (1,1,0),  and  (0,1,1).  Find  the 
mass  and  center  of  mass  of  the  object. 
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Figure  15.5.1     A  tetrahedron. 
As  usual,  the  mass  is  the  integral  of  density  over  the  region: 

M  =  /     /     /       xzdzdydx=        /      ^        ^   dy  dx  =  -  /      ^        ^  c/x 
io        io  Jo  Jx         ^  2  Jo  3 

—  -       X  —  3x^  +  3x^  —  x^  dx  =   . 

6  io  120 

We  compute  moments  as  before,  except  now  there  is  a  third  moment: 


1    /"I  py—x 


0   Jx  Jo 


■^xy  =111  dz  dy  dx 


1 


360' 


>1     r-l  ry-x 

Mxz  =  /     /    /  dz  dy  dx 


0    Jx  JO 

r-l      r-l  l-y-X 

Myz  =  /    /    /       ^'^^  dz  dy  dx 

Jo    Jx  Jo 


144' 
1 


Finally,  the  coordinates  of  the  center  of  mass  are  x  =  My^/M  =  1/3,  y  =  M^^/M  =  5/6, 
and  z  =  M^y/M  =  1/3.  □ 
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Exercises  15.5. 

pI     nx  rx+y 

1.  Evaluate  j     j     j        2x  +  y  —  1  dz  dy  dx. 

Jo  Jo  Jo 

l>2      r>x^  ny 

2.  Evaluate  /     /      /    xyz  dz  dy  dx. 

Jo  J -1  Ji 

n\     px     n\n  y 

3.  Evaluate  /    /    /      e'^^^'"  dz  dy  dx.  =^ 

Jo  Jo  Jo 

/'7r/2     ps'iwO  prcosO 

4.  Evaluate  /        /        /  dzdr  dO. 

Jo      Jo  Jo 

5.  Evaluate  /     /         /         r  cos^  6  dz  dr  dO . 


Jo   Jo  Jo 

1     /.y^  px+y 


6.  Evaluate  /     /      /       xdzdxdy.  =^> 

Jo  Jo  Jo 

ry^  /■ln(i/+z) 

7.  Evaluate  /     /      /  dxdz  dy.  =^> 

Ji  Jy  Jo 

rn      PIT /2  nl 

8.  Compute  /     /        /    z  sinx  +  z  cosy  dz  dy  dx. 


JO    Jo  JO 

9.  For  each  of  the  integrals  in  the  previous  exercises,  give  a  description  of  the  volume  (both 
algebraic  and  geometric)  that  is  the  domain  of  integration. 

10.  Compute  /   /      x  +  y  +  z  dV  over  the  region  inside  a;'^  +  y'^  +  2;^  <  1  in  the  first  octant. 


11.  Find  the  mass  of  a  cube  with  edge  length  2  and  density  equal  to  the  square  of  the  distance 
from  one  corner. 

12.  Find  the  mass  of  a  cube  with  edge  length  2  and  density  equal  to  the  square  of  the  distance 
from  one  edge.  =^ 

13.  An  object  occupies  the  volume  of  the  upper  hemisphere  oi  x'^  +  y'^  +  z^  =  A  and  has  density 
z  at  {x,y,z).  Find  the  center  of  mass.  =^ 

14.  An  object  occupies  the  volume  of  the  pyramid  with  corners  at  (1, 1,  0),  (1,  —1,  0),  (—1,  —1,  0), 
(—1, 1,  0),  and  (0,  0,  2)  and  has  density  x^  +  y^  at  {x,  y,  z).  Find  the  center  of  mass.  =^> 

15.  Verify  the  moments  Mxy,  Mxz,  and  Myz  of  example  15.5.4  by  evaluating  the  integrals. 

16.  Find  the  region  E  for  which  JJJ ~      ~      ~       d,V  is  a  maximum. 

E 


15.6    Cylindrical  and  Spherical  Coordinates 

We  have  seen  that  sometimes  double  integrals  are  simphfied  by  doing  them  in  polar  coordi- 
nates; not  surprisingly,  triple  integrals  are  sometimes  simpler  in  cylindrical  coordinates  or 
spherical  coordinates.  To  set  up  integrals  in  polar  coordinates,  we  had  to  understand  the 
shape  and  area  of  a  typical  small  region  into  which  the  region  of  integration  was  divided. 
We  need  to  do  the  same  thing  here,  for  three  dimensional  regions. 
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The  cylindrical  coordinate  system  is  the  simplest,  since  it  is  just  the  polar  coordinate 
system  plus  a  z  coordinate.  A  typical  small  unit  of  volume  is  the  shape  shown  in  fig- 
ure 15.2.1  "fattened  up"  in  the  z  direction,  so  its  volume  is  rArA^A^,  or  in  the  limit, 
r  dr  d6  dz. 

EXAMPLE  15.6.1    Find  the  volume  under  z  =  a/4  —     above  the  quarter  circle  inside 
+     =  4  in  the  first  quadrant. 
We  could  of  course  do  this  with  a  double  integral,  but  we'll  use  a  triple  integral: 


/     /  r  dz  dr  dO  =  /    y  A  —  r'^  r  dr  dO  =  — . 

'0      Jo  Jo  Jo      Jo  3 

Compare  this  to  example  15.2.1.  □ 

EXAMPLE  15.6.2    An  object  occupies  the  space  inside  both  the  cylinder  x'^  +     =  1 
and  the  sphere  x'^  +  y'^  +  z'^  =  4,  and  has  density  x'^  at  (x,  y,  z).  Find  the  total  mass. 
We  set  this  up  in  cylindrical  coordinates,  recalling  that  x  =  r  cos  9: 

/    /  r^  cos'^ (6)  dz  dr  de  =  /      /   2y^A-r^  r^  cos^ {9)  dr  dO 

0      Jo    J -^A-r^  Jo  Jo 

128     22  r-\ 

 TT 

15     5  y 

□ 

Spherical  coordinates  are  somewhat  more  difficult  to  understand.  The  small  volume 
we  want  will  be  defined  by  Ap,  A(^,  and  A^,  as  pictured  in  figure  15.6.1.  To  gain  a  better 
understanding,  see  the  Java  applet.  The  small  volume  is  nearly  box  shaped,  with  4  fiat 
sides  and  two  sides  formed  from  bits  of  concentric  spheres.  When  Ap,  Ac/),  and  /S.9  are  all 
very  small,  the  volume  of  this  little  region  will  be  nearly  the  volume  we  get  by  treating  it 
as  a  box.  One  dimension  of  the  box  is  simply  Ap,  the  change  in  distance  from  the  origin. 
The  other  two  dimensions  are  the  lengths  of  small  circular  arcs,  so  they  are  rAa  for  some 
suitable  r  and  a,  just  as  in  the  polar  coordinates  case. 

The  easiest  of  these  to  understand  is  the  arc  corresponding  to  a  change  in  cf),  which 
is  nearly  identical  to  the  derivation  for  polar  coordinates,  as  shown  in  the  left  graph  in 
figure  15.6.2.  In  that  graph  we  are  looking  "face  on"  at  the  side  of  the  box  we  are  interested 
in,  so  the  small  angle  pictured  is  precisely  A(/),  the  vertical  axis  really  is  the  z  axis,  but 
the  horizontal  axis  is  not  a  real  axis — it  is  just  some  line  in  the  x-y  plane.  Because  the 
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Figure  15.6.1     A  small  unit  of  volume  for  spherical  coordinates. 


Figure  15.6.2     Setting  up  integration  in  spherical  coordinates. 


other  arc  is  governed  by  9,  we  need  to  imagine  looking  straight  down  the  z  axis,  so  that 
the  apparent  angle  we  see  is  A^.  In  this  view,  the  axes  really  are  the  x  and  y  axes.  In  this 
graph,  the  apparent  distance  from  the  origin  is  not  p  but  psincp,  as  indicated  in  the  left 
graph. 

The  upshot  is  that  the  volume  of  the  little  box  is  approximately  Ap{pA(p)  [p  sin  (pAO)  = 
p^  sin0ApA(^A^,  or  in  the  limit     sin  (p  dp  d(pd9. 

EXAMPLE  15.6.3  Suppose  the  temperature  at  (x,  y,  z)  is  T  =  1/(1  +  +  +  z"^). 
Find  the  average  temperature  in  the  unit  sphere  centered  at  the  origin. 
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In  two  dimensions  we  add  up  the  temperature  at  "each"  point  and  divide  by  the  area; 
here  we  add  up  the  temperatures  and  divide  by  the  volume,  (4/3)7r: 


1  — nyJl  —  X^  —  y"^ 


dz  dy  dx 


This  looks  quite  messy;  since  everything  in  the  problem  is  closely  related  to  a  sphere,  we'll 
convert  to  spherical  coordinates. 

/     /   sin  (p  dp  d(pd6  = — (47r  —  tt^)  =  3  . 

Jo  io   1  +  P  47r  4 


□ 


Exercises  15.6. 


0    JO  JO 


1.  Evaluate  III  ^  ^  -dzdydx. 


+  y^  +  z 


I  I  \/ x"^  +  y"^  -\-  z"^  dz  dy  dx. 

-1  Jo  J  Jx'^+v'^ 


3.  Evaluate  /   /   /  a;^  dV  over  the  interior  of  the  cylinder  x^  +y'^  =  1  between  z  =  0  and  z  =  5. 


4.  Evaluate  I   I   I  xy  dV  over  the  interior  of  the  cylinder  x'^  +y'^  =  1  between  z  =  0  and  z  =  5 


5.  Evaluate  J  J  J  over  the  region  above  the  x-y  plane,  inside  x  +y  —2x  =  0  and  under 

+     +     =  4.  =^ 


6.  Evaluate  J  J  J  over  the  region  in  the  first  octant,  inside  x'^  +  y^  —  2x  =  0  and  under 

x'^  +  y'^  +  z^  =  4. 


7.  Evaluate  J  J  J  x  +  y  dV  over  the  interior  of  a;  +y  +  z   =4.  = 

8.  Evaluate  /  /  /  ^/x'^Ty^  dV  over  the  interior  of  x'^  +  y'^  +  z'^  =  4. 


9.  Compute  /    /      x  +  y  +  z  dV  over  the  region  inside  x  +y  +  z   <lin  the  first  octant 


10.  Find  the  mass  of  a  right  circular  cone  of  height  h  and  base  radius  a  if  the  density  is  propor- 
tional to  the  distance  from  the  base.  =^ 

11.  Find  the  mass  of  a  right  circular  cone  of  height  h  and  base  radius  a  if  the  density  is  propor- 
tional to  the  distance  from  its  axis  of  symmetry. 

12.  An  object  occupies  the  region  inside  the  unit  sphere  at  the  origin,  and  has  density  equal  to 
the  distance  from  the  x-axis.  Find  the  mass.  =^ 
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13.  An  object  occupies  the  region  inside  the  unit  sphere  at  the  origin,  and  has  density  equal  to 
the  square  of  the  distance  from  the  origin.  Find  the  mass.  =^> 

14.  An  object  occupies  the  region  between  the  unit  sphere  at  the  origin  and  a  sphere  of  radius 
2  with  center  at  the  origin,  and  has  density  equal  to  the  distance  from  the  origin.  Find  the 
mass.  =^ 

15.  An  object  occupies  the  region  in  the  first  octant  bounded  by  the  cones  (j)  =  tt/A  and  4>  = 
arctan2,  and  the  sphere  p  =  \/6,  and  has  density  proportional  to  the  distance  from  the 
origin.  Find  the  mass.  =^ 

15.7    Change  of  Variables 

One  of  the  most  useful  techniques  for  evaluating  integrals  is  substitution,  both  "•u-substitu- 
tion"  and  trigonometric  substitution,  in  which  we  change  the  variable  to  something  more 
convenient.  As  we  have  seen,  sometimes  changing  from  rectangular  coordinates  to  another 
coordinate  system  is  helpful,  and  this  too  changes  the  variables.  This  is  certainly  a  more 
complicated  change,  since  instead  of  changing  one  variable  for  another  we  change  an  entire 
suite  of  variables,  but  as  it  turns  out  it  is  really  very  similar  to  the  kinds  of  change  of 
variables  we  already  know  as  substitution. 


Figure  15.7.1     Single  change  of  variable. 

Let's  examine  the  single  variable  case  again,  from  a  slightly  different  perspective  than 
we  have  previously  used.  Suppose  we  start  with  the  problem 


this  computes  the  area  in  the  left  graph  of  figure  15.7.1.  We  use  the  substitution  x  =  sinu 
to  transform  the  function  from  x^a/1  —  to  sin^  u^/l  —  sin^  w,  and  we  also  convert  dx  to 
cos  udu.  Finally,  we  convert  the  limits  0  and  1  to  0  and  n/2.  This  transforms  the  integral: 

/•I    r^/s  .  .  

/   x^a/I  — a;^(ix=  /      sin^  uV  1  —  sin^  u  cos  u  du. 
Jo  Jo 

We  want  to  notice  that  there  are  three  different  conversions:  the  main  function,  the  differ- 
ential dx,  and  the  interval  of  integration.  The  function  is  converted  to  sin^  u^/l  —  sin^  w. 
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shown  in  the  right-hand  graph  of  figure  15.7.1.  It  is  evident  that  the  two  curves  pictured 
there  have  the  same  y- values  in  the  same  order,  but  the  horizontal  scale  has  been  changed. 
Even  though  the  heights  are  the  same,  the  two  integrals 


are  not  the  same;  clearly  the  right  hand  area  is  larger.  One  way  to  understand  the  prob- 
lem is  to  note  that  if  both  areas  are  approximated  using,  say,  ten  subintervals,  that  the 
approximating  rectangles  on  the  right  are  wider  than  their  counterparts  on  the  left,  as  in- 
dicated. In  the  picture,  the  width  of  the  rectangle  on  the  left  is  Ax  =  0.1,  between  0.7  and 
0.8.  The  rectangle  on  the  right  is  situated  between  the  corresponding  values  arcsin(0.7) 
and  arcsin(0.8)  so  that  Au  =  arcsin(0.8)  —  arcsin(0.7).  To  make  the  widths  match,  and 
the  areas  therefore  the  same,  we  can  multiply  Au  by  a  correction  factor;  in  this  case  the 
correction  factor  is  approximately  cosw  =  cos(arcsin(0.7)),  which  we  compute  when  we 
convert  dx  to  cos  udu. 

Now  let's  move  to  functions  of  two  variables.  Suppose  we  want  to  convert  an  integral 


to  use  new  variables  u  and  v.  In  the  single  variable  case,  there's  typically  just  one  reason 
to  want  to  change  the  variable:  to  make  the  function  "nicer"  so  that  we  can  find  an 
antiderivative.  In  the  two  variable  case,  there  is  a  second  potential  reason:  the  two- 
dimensional  region  over  which  we  need  to  integrate  is  somehow  unpleasant,  and  we  want 
the  region  in  terms  of  u  and  v  to  be  nicer — to  be  a  rectangle,  for  example.  Ideally,  of 
course,  the  new  function  and  the  new  region  will  be  no  worse  than  the  originals,  and  at 
least  one  of  them  will  be  better;  this  doesn't  always  pan  out. 

As  before,  there  are  three  parts  to  the  conversion:  the  function  itself  must  be  rewritten 
in  terms  of  u  and  f ,  dydx  must  be  converted  to  du  dv,  and  the  old  region  must  be  converted 
to  the  new  region.  We  will  develop  the  necessary  techniques  by  considering  a  particular 
example,  and  we  will  use  an  example  we  already  know  how  to  do  by  other  means. 


J-i  Jo 

The  limits  correspond  to  integrating  over  the  top  half  of  a  circular  disk,  and  we  recognize 
that  the  function  will  simplify  in  polar  coordinates,  so  we  would  normally  convert  to  polar 


and 


Consider 


15.7    Change  of  Variables  411 


coordinates: 


r  dr  d9 


TT 

3' 


But  let's  instead  approach  this  as  a  substitution  problem,  starting  with  x  =  rcos9,  y  — 
rsin^.  This  pair  of  equations  describes  a  function  from  "r-6'  space"  to  "x-y  space",  and 
because  it  involves  familiar  concepts,  it  is  not  too  hard  to  understand  what  it  does.  In 
figure  15.7.2  we  have  indicated  geometrically  a  bit  about  how  this  function  behaves.  The 
four  dots  labeled  a-d  in  the  r-9  plane  correspond  to  the  three  dots  in  the  x-y  plane;  dots  a 
and  h  both  go  to  the  origin  because  r  =  0.  The  horizontal  arrow  in  the  r-9  plane  has  r  =  1 
everywhere  and  9  ranges  from  0  to  tt,  so  the  corresponding  points  x  =  rcos9,  y  =  rsin^ 
start  at  (1,  0)  and  follow  the  unit  circle  counter-clockwise.  Finally,  the  vertical  arrow  has 
9  —  7^/4  and  r  ranges  from  0  to  1,  so  it  maps  to  the  straight  arrow  in  the  x-y  plane. 
Extrapolating  from  these  few  examples,  it's  not  hard  to  see  that  every  vertical  line  in  the 
r-9  plane  is  transformed  to  a  line  through  the  origin  in  the  x-y  plane,  and  every  horizontal 
line  in  the  r-9  plane  is  transformed  to  a  circle  with  center  at  the  origin  in  the  x-y  plane. 
Since  we  are  interested  in  integrating  over  the  half-disk  in  the  x-y  plane,  we  will  integrate 
over  the  rectangle  [0,  tt]  x  [0, 1]  in  the  r-9  plane,  because  we  now  see  that  the  points  in  this 
rectangle  are  sent  precisely  to  the  upper  half  disk  by  x  =  rcos9  and  y  =  rsin^. 


0 


TT 


Figure  15.7.2     Double  change  of  variable. 


At  this  point  we  are  two-thirds  done  with  the  task:  we  know  the  r-9  limits  of  integra- 
tion, and  we  can  easily  convert  the  function  to  the  new  variables: 

a/x^  -f     =  V cos^  ^  +     sin^  9  =  r\/ cos^  9  +  sin^  9  =  r.  (15.7.1) 

The  final,  and  most  difficult,  task  is  to  figure  out  what  replaces  dxdy.  (Of  course,  we 
actually  know  the  answer,  because  we  are  in  effect  converting  to  polar  coordinates.  What 
we  really  want  is  a  series  of  steps  that  gets  to  that  right  answer  but  that  will  also  work  for 
other  substitutions  that  are  not  so  familiar.) 

Let's  take  a  step  back  and  remember  how  integration  arises  from  approximation.  When 
we  approximate  the  integral  in  the  x-y  plane,  we  are  computing  the  volumes  of  tall  thin 
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boxes,  in  this  case  boxes  that  are  Ax  x  Ay  x  a/x^  +  y"^.  We  are  aiming  to  come  up  with 
an  integral  in  the  r-6  plane  that  looks  like  this: 


0  ^0 


rmdrdO. 


(15.7.2) 


What  we're  missing  is  exactly  the  right  quantity  to  replace  the  "?"  so  that  we  get  the 
correct  answer.  Of  course,  this  integral  is  also  the  result  of  an  approximation,  in  which  we 
add  up  volumes  of  boxes  that  are  Ar  x  A^  x  height;  the  problem  is  that  the  height  that 
will  give  us  the  correct  answer  is  not  simply  r.  Or  put  another  way,  we  can  think  of  the 
correct  height  as  r,  but  the  area  of  the  base  ArA^  as  being  wrong.  The  height  r  comes 
from  equation  15.7.1,  which  is  to  say,  it  is  precisely  the  same  as  the  corresponding  height 
in  the  x-y  version  of  the  integral.  The  problem  is  that  the  area  of  the  base  Ax  x  Ay  is  not 
the  same  as  the  area  of  the  base  Ar  x  A^.  We  can  think  of  the  "?"  in  the  integral  as  a 
correction  factor  that  is  needed  so  that  1  dr  dO  =  dx  dy. 

So  let's  think  about  what  that  little  base  Ar  x  A^  corresponds  to.  We  know  that  each 
bit  of  horizontal  line  in  the  r-9  plane  corresponds  to  a  bit  of  circular  arc  in  the  x-y  plane, 
and  each  bit  of  vertical  line  in  the  r-9  plane  corresponds  to  a  bit  of  "radial  line"  in  the  x-y 
plane.  In  figure  15.7.3  we  show  a  typical  rectangle  in  the  r-9  plane  and  its  corresponding 
area  in  the  x-y  plane. 


Ar 


Ae 


TT 


y 
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Figure  15.7.3     Corresponding  areas. 


In  this  case,  the  region  in  the  x-y  plane  is  approximately  a  rectangle  with  dimensions 
Ar  X  rA^,  but  in  general  the  corner  angles  will  not  be  right  angles,  so  the  region  will 
typically  be  (almost)  a  parallelogram.  We  need  to  compute  the  area  of  this  parallelogram. 
We  know  a  neat  way  to  do  this:  compute  the  length  of  a  certain  cross  product  (page  315). 
If  we  can  determine  an  appropriate  two  vectors  we'll  be  nearly  done. 

Fortunately,  we've  really  done  this  before.  The  sides  of  the  region  in  the  x-y  plane 
are  formed  by  temporarily  fixing  either  r  or  9  and  letting  the  other  variable  range  over  a 
small  interval.  In  figure  15.7.3,  for  example,  the  upper  right  edge  of  the  region  is  formed 
by  fixing  9  =  27r/3  and  letting  r  run  from  0.5  to  0.75.  In  other  words,  we  have  a  vector 
function  v(r)  =  (r  cos  6*0,  r  sin  6*0,  0),  and  we  are  interested  in  a  restricted  set  of  values 
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for  r.  A  vector  tangent  to  this  path  is  given  by  the  derivative  v'(r)  =  (cos  6*0,  sin  6*0,  0), 
and  a  smaU  tangent  vector,  with  length  approximately  equal  to  the  side  of  the  region,  is 
(cos  6*0,  sin  ^0,  0)  (ir.  Likewise,  if  we  fix  r  =  ro  =  0.5,  we  get  the  vector  function  mv{9)  = 
(ro  cos  6*,  ro  sin  6*,  0)  with  derivative  mv{9)  =  {—tq  sin  6*,  ro  cos  6*,  0)  and  a  small  tangent  vector 
(—ro  sin 6*0,  ro  cos  6*0,  0)  (i6'  when  9  =  9q  (at  the  corner  we're  focusing  on).  These  vectors 
are  shown  in  figure  15.7.4,  with  the  actual  region  outlined  by  a  dotted  boundary.  Of 
course,  since  both  Ar  and  are  quite  large,  the  parallelogram  is  not  a  particularly  good 
approximation  to  the  true 


T 
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X 


-10  1 
Figure  15.7.4     The  approximating  parallelogram. 


The  area  of  this  parallelogram  is  the  length  of  the  cross  product: 
(— ro  sin  6*0,  ro  cos  9o,  0)  d9  x  (cos  6*0,  sin  6*0,  0)  dr  = 


i  j  k 

-ro  sin  6*0    ro  cos  9o    0    d9  dr 
cos  6*0        sin  9o  0 

(0,  0,  — ro  sin^  9o  —  ro  cos^  9o)  d9  dr. 


The  length  of  this  vector  is  ro  dr  d9.  So  in  general,  for  any  values  of  r  and  9,  the  area 
in  the  x-y  plane  corresponding  to  a  small  rectangle  anchored  at  {9,  r)  in  the  r-9  plane  is 
approximately  rdrd9.  In  other  words,  "r"  replaces  the  "?"  in  equation  15.7.2. 

In  general,  a  substitution  will  start  with  equations  x  =  f(u,  v)  and  y  =  g{u,  v).  Again, 
it  will  be  straightforward  to  convert  the  function  being  integrated.  Converting  the  limits 
will  require,  as  above,  an  understanding  of  just  how  the  functions  /  and  g  transform  the 
u-v  plane  into  the  x-y  plane.  Finally,  the  small  vectors  we  need  to  approximate  an  area  will 
be  {fuiQui^)  du  and  {fy,  gy,0)  dv.  The  cross  product  of  these  is  (0,  0, —  g^fy)  dudv 
with  length  \fugv  —  gufv  \  dudv.  The  quantity  \fugv  —  gufv\  is  usually  denoted 

d{x,y) 
d{u,  v) 

and  called  the  Jacobian.  Note  that  this  is  the  absolute  value  of  the  two  by  two  determinant 


\fu9v  9ufv 


fu 
fv 


9u 
9v 
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which  may  be  easier  to  remember.  (Confusingly,  the  matrix,  the  determinant  of  the  matrix, 
and  the  absolute  value  of  the  determinant  are  all  called  the  Jacobian  by  various  authors.) 

Because  there  are  two  things  to  worry  about,  namely,  the  form  of  the  function  and 
the  region  of  integration,  transformations  in  two  (or  more)  variables  are  quite  tricky  to 
discover. 

EXAMPLE  15.7.1     Integrate      —  xy         over  the  region  x'^  —  xy  +  y^  <  2. 

The  equation  x'^  —  xy  +  y'^  =  2  describes  an  ellipse  as  in  figure  15.7.5;  the  region  of 
integration  is  the  interior  of  the  ellipse.  We  will  use  the  transformation  x  —  y/2u  —  A/2/3f , 
y  =  y/2u  +  A/2/3f .  Substituting  into  the  function  itself  we  get 

x'^  -xy  +  y'^  =  2u^  +2v^. 

The  boundary  of  the  ellipse  is  x"^  —  xy  +  y'^  <  2,  so  the  boundary  of  the  corresponding 
region  in  the  u-v  plane  is  2w^  +  2f  ^  =  2  or  tt^  +  f  ^  =  1,  the  unit  circle,  so  this  substitution 
makes  the  region  of  integration  simpler. 

Next,  we  compute  the  Jacobian,  using  /  =  V2u  -  ^/2/3v  and  g  =  V2u  +  ^/2/3v: 


fu9v  -  9ufv  =  v^\/273  +  V2^/2/3  = 


Hence  the  new  integral  is 


where  R  is  the  interior  of  the  unit  circle.  This  is  still  not  an  easy  integral,  but  it  is  easily 
transformed  to  polar  coordinates,  and  then  easily  integrated.  □ 


Figure  15.7.5     x'^  -  xy  +      =  2 
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There  is  a  similar  change  of  variables  formula  for  triple  integrals,  though  it  is  a  bit 
more  difficult  to  derive.  Suppose  we  use  three  substitution  functions,  x  =  f{u,v,w), 
y  =  g{u,  V,  w),  and  z  =  h{u,  v,  w).  The  Jacobian  determinant  is  now 


fu  9u 
fv  9v 
fw    9w  hn 


Then  the  integral  is  transformed  in  a  similar  fashion: 


F{x,y,z)  dV 


R 


F{f{u,  V,  w),  g{u,  V,  w),h{u,  v,  w)) 


d{x,y,z) 


d{u,  V,  w) 


du  dv  dw. 


where  of  course  the  region  5"  in  uvw  space  corresponds  to  the  region  R  in  xyz  space. 
Exercises  15.7. 

1.  Complete  example  15.7.1  by  converting  to  polar  coordinates  and  evaluating  the  integral.  =^ 

2.  Evaluate  jj  xydxdy  over  the  square  with  corners  (0,0),  (1,1),  (2,0),  and  (1,-1)  in  two 

ways:  directly,  and  using  x  =  {u  +  v)/2,  y  =  iu  —  v)/2. 

3.  Evaluate  j j  x^  +  y^  dx  dy  over  the  square  with  corners  (  —  1,  0),  (0, 1),  (1,  0),  and  (0,  —1)  in 

two  ways:  directly,  and  using  x  =  {u  +  v)/2,  y  =  (u  —  v)/2.  =^ 

4.  Evaluate  JJ {x  +  y)e^^^  dx  dy  over  the  triangle  with  corners  (0,0),  (—1, 1),  and  (1, 1)  in  two 

ways:  directly,  and  using  x  =  {u  +  v)/2,  y  =  {u  —  v)/2.  =^> 

5.  Evaluate  JJ  y{x  —  y)  dx  dy  over  the  parallelogram  with  corners  (0,  0),  (3,  3),  (7,  3),  and  (4,  0) 

in  two  ways:  directly,  and  using  x  =  u  +  v,  y  =  u.  =^ 

6.  Evaluate   /  /  \/ x'^  +  y^  dx  dy  over  the  triangle  with  corners  (0,0),  (4,4),  and  (4,0)  using 


X  =  u,  y  =  uv.  =^ 

7.  Evaluate  J  J  ysm{xy)  dxdy  over  the  region  bounded  by  xy  =  1,  xy  =  4,  y  =  1,  and  y  =  4 
using  X  =  u/v,  y  =  V. 

8.  Evaluate  //  sin(9a;^  +  4y^)  d^,  over  the  region  in  the  first  quadrant  bounded  by  the  ellipse 


9x^  +  4y2  =  1.  ^ 

9.  Compute  the  Jacobian  for  the  substitutions  x  =  psinc^cos^,  y  =  psin^sin^,  z  =  pcosc 
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10.  Evaluate  /  /  /  dV  where  E  is  the  solid  enclosed  by  the  ellipsoid 


2  2  2 

X  y       z  ^ 

 \-  - — I  =  1, 

-.2  7^2  /i2 


using  the  transformation  x  =  au,  y  =  bv,  and  z  =  cw. 


16 

Vector  Calculus 


16.1    Vector  Fields 

This  chapter  is  concerned  with  applying  calculus  in  the  context  of  vector  fields.  A 
two-dimensional  vector  field  is  a  function  /  that  maps  each  point  (x,  y)  in  to  a  two- 
dimensional  vector  {u,v),  and  similarly  a  three-dimensional  vector  field  maps  {x,y,z)  to 
{u,v,w).  Since  a  vector  has  no  position,  we  typically  indicate  a  vector  field  in  graphical 
form  by  placing  the  vector  f{x,y)  with  its  tail  at  {x^y).  Figure  16.1.1  shows  a  represen- 
tation of  the  vector  field  /(x,  y)  =  {x/ ^/x'^  +  y'^  +  4,  —y/ a/x^  +  +  4).  For  such  a  graph 
to  be  readable,  the  vectors  must  be  fairly  short,  which  is  accomplished  by  using  a  different 
scale  for  the  vectors  than  for  the  axes.  Such  graphs  are  thus  useful  for  understanding  the 
sizes  of  the  vectors  relative  to  each  other  but  not  their  absolute  size. 

Vector  fields  have  many  important  applications,  as  they  can  be  used  to  represent  many 
physical  quantities:  the  vector  at  a  point  may  represent  the  strength  of  some  force  (gravity, 
electricity,  magnetism)  or  a  velocity  (wind  speed  or  the  velocity  of  some  other  fiuid). 

We  have  already  seen  a  particularly  important  kind  of  vector  field — the  gradient.  Given 
a  function  f{x,y),  recall  that  the  gradient  is  {fx{x,y),fy{x,y)),  a  vector  that  depends  on 
(is  a  function  of)  x  and  y.  We  usually  picture  the  gradient  vector  with  its  tail  at  (x,y), 
pointing  in  the  direction  of  maximum  increase.  Vector  fields  that  are  gradients  have  some 
particularly  nice  properties,  as  we  will  see.  An  important  example  is 

F=/  Zl  \ 

\  (x2  -f  y2  _(_  ^2^/2  '  (3,2  +  y2  _^  ^2^3/2  '  (3,2  +  y2  _^  ^2^/2  /  ' 
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Figure  16.1.1     A  vector  field, 
which  points  from  the  point  (x,     z)  toward  the  origin  and  has  length 


\fxW^]pW-^ 


1 


(^2  +  ^2  +  ^2)3/2        (^^2  +  ^2+^2)2  ' 

which  is  the  reciprocal  of  the  square  of  the  distance  from  (x,  z)  to  the  origin — in  other 
words,  F  is  an  "inverse  square  law" .  The  vector  F  is  a  gradient: 


1 


(16.1.1) 


which  turns  out  to  be  extremely  useful. 


Exercises  16.1. 


Sketch  the  vector  fields;  check  your  work  with  Sage's  plot_vector_f  ield  function. 


1. 
2. 
3. 
4. 
5. 
6. 


{x,y) 
{-X,  -y) 
{x,  -y) 
(sin a;,  cosy) 

{y,  i/a;) 

(x  +  1,  X  +  3) 


7.  Verify  equation  16.1.1. 
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16.2    Line  Integrals 

We  have  so  far  integrated  "over"  intervals,  areas,  and  volumes  with  single,  double,  and 
triple  integrals.  We  now  investigate  integration  over  or  "along"  a  curve — "line  integrals" 
are  really  "curve  integrals" . 

As  with  other  integrals,  a  geometric  example  may  be  easiest  to  understand.  Consider 
the  function  f  =  x  +  y  and  the  parabola  y  =  in  the  x-y  plane,  for  0  <  x  <  2.  Imagine 
that  we  extend  the  parabola  up  to  the  surface  /,  to  form  a  curved  wall  or  curtain,  as  in 
figure  16.2.1.  What  is  the  area  of  the  surface  thus  formed?  We  already  know  one  way  to 
compute  surface  area,  but  here  we  take  a  different  approach  that  is  more  useful  for  the 
problems  to  come. 


Figure  16.2.1     Approximating  the  area  under  a  curve. 

As  usual,  we  start  by  thinking  about  how  to  approximate  the  area.  We  pick  some 
points  along  the  part  of  the  parabola  we're  interested  in,  and  connect  adjacent  points  by 
straight  lines;  when  the  points  are  close  together,  the  length  of  each  line  segment  will  be 
close  to  the  length  along  the  parabola.  Using  each  line  segment  as  the  base  of  a  rectangle, 
we  choose  the  height  to  be  the  height  of  the  surface  /  above  the  line  segment.  If  we  add 
up  the  areas  of  these  rectangles,  we  get  an  approximation  to  the  desired  area,  and  in  the 
limit  this  sum  turns  into  an  integral. 

Typically  the  curve  is  in  vector  form,  or  can  easily  be  put  in  vector  form;  in  this 
example  we  have  v(t)  =  Then  as  we  have  seen  in  section  13.3  on  arc  length, 

the  length  of  one  of  the  straight  line  segments  in  the  approximation  is  approximately 
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ds  =  \v'\dt  =  Vl  +      dt,  so  the  integral  is 

2  2 

^  f{t,t'Wl  +  4t^dt  =  it  +  t')^/l^dt=^V^7-^-^H4  +  Vl7). 
This  integral  of  a  function  along  a  curve  C  is  often  written  in  abbreviated  form  as 

f{x,y)  ds. 


EXAMPLE  16.2.1    Compute  /  ye""  ds  where  C  is  the  line  segment  from  (1,  2)  to  (4,  7). 

Jc 

We  write  the  line  segment  as  a  vector  function:  v  =  (1,2)  + 1(3,5),  0  <  t  <  1,  or  in 
parametric  form  x  =  1  +  3t,  y  =  2  +  5t.  Then 

/  ye''ds=  /\2  +  5t)e^+^*A/32T52dt=  ^v^e^- ^v^e. 
Jc  Jo  9  9 

□ 

All  of  these  ideas  extend  to  three  dimensions  in  the  obvious  way. 

EXAMPLE  16.2.2     Compute  /  x'^zds  where  C  is  the  line  segment  from  (0,  6,  -1)  to 

Jc 

(4,1,5). 

We  write  the  line  segment  as  a  vector  function:  v  =  (0,  6,  —1)  +  t(4,  —5,  6),  0  <  t  <  1, 
or  in  parametric  form  x  —  4t,  y  =  6  —  5t,  z  =  —1  +  6t.  Then 

/  x^zds=  [  (4t)2(-l  +  6t)  Vl6  +  25  +  36     =  16^77  /  -t'^  +  6t^  dt  =  ^Vff. 
Jc  Jo  Jo  3 

□ 

Now  we  turn  to  a  perhaps  more  interesting  example.  Recall  that  in  the  simplest  case, 
the  work  done  by  a  force  on  an  object  is  equal  to  the  magnitude  of  the  force  times  the 
distance  the  object  moves;  this  assumes  that  the  force  is  constant  and  in  the  direction  of 
motion.  We  have  already  dealt  with  examples  in  which  the  force  is  not  constant;  now  we 
are  prepared  to  examine  what  happens  when  the  force  is  not  parallel  to  the  direction  of 
motion. 
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We  have  already  examined  the  idea  of  components  of  force,  in  example  12.3.4:  the 
component  of  a  force  F  in  the  direction  of  a  vector  v  is 


F  V 

-V 


V 


2 


the  projection  of  F  onto  v.  The  length  of  this  vector,  that  is,  the  magnitude  of  the  force 
in  the  direction  of  v,  is 

F  V 


the  scalar  projection  of  F  onto  v.  If  an  object  moves  subject  to  this  (constant)  force,  in 
the  direction  of  v,  over  a  distance  equal  to  the  length  of  v,  the  work  done  is 

— — |v|  =  F  ■  V. 

|v| 

Thus,  work  in  the  vector  setting  is  still  "force  times  distance" ,  except  that  "times"  means 
"dot  product". 

If  the  force  varies  from  point  to  point,  it  is  represented  by  a  vector  field  F;  the  dis- 
placement vector  V  may  also  change,  as  an  object  may  follow  a  curving  path  in  two  or 
three  dimensions.  Suppose  that  the  path  of  an  object  is  given  by  a  vector  function  r{t);  at 
any  point  along  the  path,  the  (small)  tangent  vector  r'  At  gives  an  approximation  to  its 
motion  over  a  short  time  At,  so  the  work  done  during  that  time  is  approximately  F  ■  r'  At; 
the  total  work  over  some  time  period  is  then 

ti 

F  ■  r'  dt. 

to 

It  is  useful  to  rewrite  this  in  various  ways  at  different  times.  We  start  with 

F-r'  dt=  I  F  •  dr, 

to  Jc 

abbreviating  r'  dt  by  dr.  Or  we  can  write 

tl  fti  I  rti  /■ 

F-r'dt=  /    F-—\r'\dt=  /    F-T\r'\dt=  /  F-Tds, 


to  J  to  \    \  J  to  JC 

using  the  unit  tangent  vector  T,  abbreviating  |r'|  dt  as  ds,  and  indicating  the  path  of  the 
object  by  C.  In  other  words,  work  is  computed  using  a  particular  line  integral  of  the  form 
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we  have  considered.  Alternately,  we  sometimes  write 


f  dx  +  g  dy  +  hdz  =  /  f  dx  +  /  gdy+  /  hdz, 
c  Jc  Jc  Jc 

and  similarly  for  two  dimensions,  leaving  out  references  to  z. 

EXAMPLE  16.2.3  Suppose  an  object  moves  from  (—1,1)  to  (2,4)  along  the  path 
r{t)  =  {t,t^),  subject  to  the  force  F  =  {xsiny^y).  Find  the  work  done. 

We  can  write  the  force  in  terms  of  t  as  (t  sin(t^),  t-^),  and  compute  r'{t)  =  (1,  2t),  and 
then  the  work  is 


/2  /•2 
{tsm{t'^),t^)  ■  {l,2t)dt  =  J    tsm{t'^)  +  2t^  dt  =  —  + 

Alternately,  we  might  write 


15     cos(l)  -  cos(4) 


2 


•       J  f      ^  r       ■    r  ^  C0S(4)       COs(l)       16  1 

xsmydx+  /  y  dy  =  /    xsm(x  )  dx  +  /    y  dy  =  1  1  

getting  the  same  answer.  □ 

Exercises  16.2. 

1.  Compute  /  xy^  ds  along  the  line  segment  from  (1,2,0)  to  (2, 1,3). 

Jc 

2.  Compute  /  sinxds  along  the  line  segment  from  (—1,2,1)  to  (1,2,5). 

Jc 

3.  Compute  /  zcos{xy)ds  along  the  line  segment  from  (1,0,1)  to  (2,2,3).  =^> 

Jc 

4.  Compute  /  sinxdx  +  cosydy  along  the  top  half  of  the  unit  circle,  from  (1,  0)  to  (—1,  0).  =^> 

Jc 

6.  Compute  /  xe^  dx  +  x^ydy  along  the  line  segment  x  =  4,  0<y<4.  =^ 
Jc 

8.  Compute  /  xe^  dx  +  x^ydy  along  the  curve  (e*,  e*),  —1  <  ^  <  1.  =^ 

Jc 

9.  Compute  /  (cos x, sin y)  •  dr  along  the  curve  {t,t),  0  <  t  <  1.  =^> 

Jc 
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10.  Compute  /  {l/xy,  l/{x  +  y))  ■  dv  along  the  path  from  (1, 1)  to  (3, 1)  to  (3,  6)  using  straight 

Jc 

Une  segments.  =^> 

11.  Compute  /  (l/a;y,         +  y))  •  dr  along  the  curve  (2t,  5t),  1  <  t  <  4.  =^ 

Jc 

12.  Compute  /  (l/a;y,         +  y))  •  dr  along  the  curve 

Jc 

14.  Compute  /  yzdx  +  xz  dy  +  xy  dz  along  the  curve  (cost,  sint,  tant),  0  <  t  <  tt.  =^ 

Jc 

15.  An  object  moves  from  (1,1)  to  (4,8)  along  the  path  r{t)  =  {t^,t^),  subject  to  the  force 
F  =  (a;'^,siny).  Find  the  work  done.  =^> 

16.  An  object  moves  along  the  line  segment  from  (1,1)  to  (2,5),  subject  to  the  force  F  = 
{x/{x^  +  y^),y / {x^  +  y'^)) .  Find  the  work  done. 

17.  An  object  moves  along  the  parabola  r(t)  =  {t,t'^),  0  <  t  <  1,  subject  to  the  force  F  = 
(l/(y  +  1),  — l/(x  +  1)).  Find  the  work  done.  =^ 

18.  An  object  moves  along  the  line  segment  from  (0,0,0)  to  (3,6, 10),  subject  to  the  force  F  = 
{x'^ ,y'^ ,  z'^).  Find  the  work  done. 

19.  An  object  moves  along  the  curve  r{t)  =  (Vt,  1/Vi,t)  1  <  t  <  i,  subject  to  the  force  F  = 
{y,z,x).  Find  the  work  done.  =J> 

20.  An  object  moves  from  (1, 1, 1)  to  (2,  4,  8)  along  the  path  r{t)  =  {t,  t'^,t^),  subject  to  the  force 
F  =  (sin  X,  sin  y,  sin  z).  Find  the  work  done.  =^> 

21.  An  object  moves  from  (1,0,0)  to  (— l,0,7r)  along  the  path  r{t)  =  (cos  t,  sint,  t),  subject  to 
the  force  F  =  {y"^  ,y'^  ,xz) .  Find  the  work  done. 

22.  Give  an  example  of  a  non-trivial  force  field  F  and  non-trivial  path  r(t)  for  which  the  total 
work  done  moving  along  the  path  is  zero. 

16.3    The  Fundamental  Theorem  of  Line  Integrals 

One  way  to  write  the  Fundamental  Theorem  of  Calculus  (7.2.1)  is: 


f  f'{x)  dx  =  f{b)-f{a) 

J  a 


That  is,  to  compute  the  integral  of  a  derivative  /'  we  need  only  compute  the  values  of  / 
at  the  endpoints.  Something  similar  is  true  for  line  integrals  of  a  certain  form. 

THEOREM  16.3.1  Fundamental  Theorem  of  Line  Integrals  Suppose  a  curve 
C  is  given  by  the  vector  function  r(t),  with  a  =  r(a)  and  b  =  r(6).  Then 


V/-rfr  =  /(b)-/(a), 


c 


provided  that  r  is  sufficiently  nice. 
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Proof.  We  write  r  =  {x{t),y{t),  z{t)) ,  so  that  r'  =  {x' {t) ,  y' (t) ,  z' (t)) .  Also,  we  know 
that  V/ =  Then 

Vf-dr=      {f^jyj,)-{x'{t),y'{t),z'{t))dt=       f^x' +  fyy' +  f,z' dt. 

J  a  J  a 


C 


By  the  chain  rule  (see  section  14.4)  /^^x'  +  fyy'  +  f^z'  =  df  /dt,  where  /  in  this  context 
means  f{x{t),  y{t),  z{t)),  a  function  of  t.  In  other  words,  all  we  have  is 

-b 


[  f'{t)dt  =  f{b)-f{a). 

J  a 


In  this  context,  /(a)  =  f{x{a),  y{a),  z{a)).  Since  a  —  r(a)  =  (x(a),  y(a),  z{a)),  we  can  write 
/(a)  =  /(a) — this  is  a  bit  of  a  cheat,  since  we  are  simultaneously  using  /  to  mean  f{t)  and 
f{x,y,z),  and  since  f{x{a),y{a),z{a))  is  not  technically  the  same  as  f{{x{a),y{a),z{a))), 
but  the  concepts  are  clear  and  the  different  uses  are  compatible.  Doing  the  same  for  b,  we 
get 

/  Vf-dr=  [  fit)  dt  =  fib)  -  fia)  =  /(b)  -  /(a).  . 

This  theorem,  like  the  Fundamental  Theorem  of  Calculus,  says  roughly  that  if  we 
integrate  a  "derivative-like  function"  (/'  or  V/)  the  result  depends  only  on  the  values  of 
the  original  function  (/)  at  the  endpoints. 

If  a  vector  field  F  is  the  gradient  of  a  function,  F  =  V/,  we  say  that  F  is  a  conserva- 
tive vector  field.  If  F  is  a  conservative  force  field,  then  the  integral  for  work,  F  ■  dr, 
is  in  the  form  required  by  the  Fundamental  Theorem  of  Line  Integrals.  This  means  that 
in  a  conservative  force  field,  the  amount  of  work  required  to  move  an  object  from  point  a 
to  point  b  depends  only  on  those  points,  not  on  the  path  taken  between  them. 

EXAMPLE  16.3.2     An  object  moves  in  the  force  field 

—X  —y  —z 


+  y2  _(_  ^2^3/2  '  (^^2  +  y2      ^2)3/2  '  (^^2  +  y2  _^  ^2)3/2 

along  the  curve  r  =  (1  + 1,  t"^,  t  cos(7rt))  as  t  ranges  from  0  to  1.  Find  the  work  done  by  the 
force  on  the  object. 

The  straightforward  way  to  do  this  involves  substituting  the  components  of  r  into  F, 
forming  the  dot  product  F  ■  r' ,  and  then  trying  to  compute  the  integral,  but  this  integral  is 


extraordinarily  messy,  perhaps  impossible  to  compute.  But  since  F  =  V(l/ yx"^  +  y'^  +  z"^) 
we  need  only  substitute: 

(2,1,-1) 


F  -dr  ^ 


c  a/x^  -f     +  2^ 


(1,0,0) 


□ 
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Another  immediate  consequence  of  the  Fundamental  Theorem  involves  closed  paths. 
A  path  C  is  closed  if  it  forms  a  loop,  so  that  traveling  over  the  C  curve  brings  you  back  to 
the  starting  point.  If  C  is  a  closed  path,  we  can  integrate  around  it  starting  at  any  point 
a;  since  the  starting  and  ending  points  are  the  same, 

/  V/  ■  rfr  =  /(a)  -  /(a)  =  0. 
Jc 

For  example,  in  a  gravitational  field  (an  inverse  square  law  field)  the  amount  of  work 
required  to  move  an  object  around  a  closed  path  is  zero.  Of  course,  it's  only  the  net 
amount  of  work  that  is  zero.  It  may  well  take  a  great  deal  of  work  to  get  from  point  a  to 
point  b,  but  then  the  return  trip  will  "produce"  work.  For  example,  it  takes  work  to  pump 
water  from  a  lower  to  a  higher  elevation,  but  if  you  then  let  gravity  pull  the  water  back 
down,  you  can  recover  work  by  running  a  water  wheel  or  generator.  (In  the  real  world  you 
won't  recover  all  the  work  because  of  various  losses  along  the  way.) 

To  make  use  of  the  Fundamental  Theorem  of  Line  Integrals,  we  need  to  be  able  to 
spot  conservative  vector  fields  F  and  to  compute  /  so  that  F  =  V/.  Suppose  that  F  = 
(P,  Q)  =  V/.  Then  P  =  and  Q  =  fy,  and  provided  that  /  is  sufficiently  nice,  we  know 
from  Clairaut's  Theorem  (14.6.2)  that  Py  =  fxy  =  fyx  =  Qx-  If  we  compute  Py  and  Qx 
and  find  that  they  are  not  equal,  then  F  is  not  conservative.  If  Py  =  Q^,  then,  again 
provided  that  F  is  sufficiently  nice,  we  can  be  assured  that  F  is  conservative.  Ultimately, 
what's  important  is  that  we  be  able  to  find  /;  as  this  amounts  to  finding  anti-derivatives, 
we  may  not  always  succeed. 

EXAMPLE  16.3.3     Find  an  /  so  that  (3  +  2xy,     -  3y^)  =  V/. 
First,  note  that 

d  d 
—  (3  +  2xy)=2x       and       —  (x^  —  3w^)  =  2x, 
oy  ox 

so  the  desired  /  does  exist.  This  means  that  /x  =  3  +  2xy,  so  that  /  =  3x  +  x^y  +  g{y); 
the  first  two  terms  are  needed  to  get  3  +  2xy,  and  the  g{y)  could  be  any  function  of  y,  as  it 
would  disappear  upon  taking  a  derivative  with  respect  to  x.  Likewise,  since  fy  =  x'^  —  3y^, 
f  =  x^y  —  y^  +  h{x).  The  question  now  becomes,  is  it  possible  to  find  g{y)  and  h{x)  so 
that 

3x  +  x'^y  +  g{y)  =  x'^y  —  y^  +  h{x), 

and  of  course  the  answer  is  yes:  g{y)  =  —y^,  h{x)  =  3x.  Thus,  f  =  3x  +  x'^y  —  y^.  □ 

We  can  test  a  vector  field  F  =  (P,  Q,  R)  in  a  similar  way.  Suppose  that  (P,  Q,  R)  = 
{fx,  fy,  fz) ■  If  we  temporarily  hold  z  constant,  then  f{x,y,z)  is  a  function  of  x  and  y. 
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and  by  Clairaut's  Theorem  Py  =  fj^y  =  fyj^  =  Qj^.  Likewise,  holding  y  constant  imphes 
Pz  =  fxz  =  fzx  =  Rxi  and  with  x  constant  we  get  =  fyz  =  fzy  =  Ry  Conversely,  if  we 
find  that  Py  =  Qx,  Pz  =  Rx,  and  Qz  =  Ry  then  F  is  conservative. 

Exercises  16.3. 

1.  Find  an  /  so  that  V/  =  {2x  +     ,  2y  +  x'^),  or  explain  why  there  is  no  such  /.  =^ 

2.  Find  an  /  so  that  V/  =  {x^,  — y^),  or  explain  why  there  is  no  such  /. 

3.  Find  an  /  so  that  V/  =  {xe^ ,  ye^),  or  explain  why  there  is  no  such  /. 

4.  Find  an  /  so  that  V/  =  (ycosx,ysinx),  or  explain  why  there  is  no  such  /. 

5.  Find  an  /  so  that  V/  =  (y  cos  a;,  sin  x),  or  explain  why  there  is  no  such  /. 

6.  Find  an  /  so  that  V/  =  {x'^y^,  xy^),  or  explain  why  there  is  no  such  /.  =^ 

7.  Find  an  f  so  that  Vf  =  (yz,  xz,  xy) ,  or  explain  why  there  is  no  such  f.  =^> 

Jc 

from  (0,0)  to  (3,2).  ^ 

9.  Let  F  =  {yz,  xz,  xy) .  Find  the  work  done  by  this  force  field  on  an  object  that  moves  from 
(1,0,2)  to  (1,2,3).  ^ 

10.  Let  F  =  {e^,xe^  +  sin  z,  y  cos  z).  Find  the  work  done  by  this  force  field  on  an  object  that 
moves  from  (0,0,0)  to  (1,-1,3).  =^> 

11.  Let 

F=  /  Zl  \ 

\  (X2  +  y2  +  ^2)3/2  '  (a;2  +  y2  +  ^2)3/2  '  (3,2  +  y2  +  ^^^3/2  J  ' 

Find  the  work  done  by  this  force  field  on  an  object  that  moves  from  (1, 1, 1)  to  (4,  5,  6). 

16.4    Green's  Theorem 

We  now  come  to  the  first  of  three  important  theorems  that  extend  the  Fundamental  The- 
orem of  Calculus  to  higher  dimensions.  (The  Fundamental  Theorem  of  Line  Integrals  has 
already  done  this  in  one  way,  but  in  that  case  we  were  still  dealing  with  an  essentially 
one- dimensional  integral.)  They  all  share  with  the  Fundamental  Theorem  the  following 
rather  vague  description:  To  compute  a  certain  sort  of  integral  over  a  region,  we  may  do 
a  computation  on  the  boundary  of  the  region  that  involves  one  fewer  integrations. 

Note  that  this  does  indeed  describe  the  Fundamental  Theorem  of  Calculus  and  the 
Fundamental  Theorem  of  Line  Integrals:  to  compute  a  single  integral  over  an  interval,  we 
do  a  computation  on  the  boundary  (the  endpoints)  that  involves  one  fewer  integrations, 
namely,  no  integrations  at  all. 
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THEOREM  16.4.1  Green's  Theorem  If  the  vector  field  F  =  (P,  Q)  and  the  region 
D  are  sufficiently  nice,  and  if  C  is  the  boundary  of  D  (C  is  a  closed  curve),  then 

dQ  dP 


D 


dA^      Pdx  +  Qdy, 
ox      dy  Jc 


provided  the  integration  on  the  right  is  done  counter-clockwise  around  C.  □ 

To  indicate  that  an  integral   /    is  being  done  over  a  closed  curve  in  the  counter- 

Jc 

clockwise  direction,  we  usually  write  (p  .    We  also  use  the  notation  dD  to  mean  the 


JC 

boundary  of  D  oriented  in  the  counterclockwise  direction.  With  this  notation, 

'C  JdD 

We  already  know  one  case,  not  particularly  interesting,  in  which  this  theorem  is  true: 

If  F  is  conservative,  we  know  that  the  integral  ^  F  ■  dr  =  0,  because  any  integral  of  a 

conservative  vector  field  around  a  closed  curve  is  zero.  We  also  know  in  this  case  that 
dP/dy  =  dQ/dx,  so  the  double  integral  in  the  theorem  is  simply  the  integral  of  the  zero 
function,  namely,  0.  So  in  the  case  that  F  is  conservative,  the  theorem  says  simply  that 
0  =  0. 

EXAMPLE  16.4.2    We  illustrate  the  theorem  by  computing  both  sides  of 

If...... 

D 


JdD 


where  D  is  the  triangular  region  with  corners  (0, 0),  (1,  0),  (0, 1). 
Starting  with  the  double  integral: 


II 


y  —  0  dA  =        /       ydydx=  dx  = 

^  0  Jo  Jo  6 


1 

6" 


D 

There  is  no  single  formula  to  describe  the  boundary  of  -D,  so  to  compute  the  left  side 
directly  we  need  to  compute  three  separate  integrals  corresponding  to  the  three  sides  of 
the  triangle,  and  each  of  these  integrals  we  break  into  two  integrals,  the  "(ia;"  part  and  the 
''dy"  part.  The  three  sides  are  described  by  y  =  0,  y  =  1  —  x,  and  x  —  The  integrals 
are  then 

•1  pO  pO  pi  rO  rO 


Jo 


X  dx  +  xydy=       x  dx  +      Ody+      x  dx  +  j   {l  —  y)ydy+  /   Odx+  j  Ody 
dD  Jo  Jo  Ji 

111  1 

Alternately,  we  could  describe  the  three  sides  in  vector  form  as  (t,  0),  (1  —  t,t),  and 
(0, 1  —  t).  Note  that  in  each  case,  as  t  ranges  from  0  to  1,  we  follow  the  corresponding  side 
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in  the  correct  direction.  Now 

/    x'^  dx  +  xy  dy  =  f  t'^  +  t-Odt+  f  -{1  -  t)^  +  {1  -  t)tdt  +  [  0  +  Odt 

JdD  Jo  Jo  Jo 

=   f   t^dt+  f  -{l-tf+{l-t)tdt=l. 

Jo  Jo  6 


□ 


In  this  case,  none  of  the  integrations  are  difficult,  but  the  second  approach  is  some- 
what tedious  because  of  the  necessity  to  set  up  three  different  integrals.  In  different 
circumstances,  either  of  the  integrals,  the  single  or  the  double,  might  be  easier  to  compute. 
Sometimes  it  is  worthwhile  to  turn  a  single  integral  into  the  corresponding  double  integral, 
sometimes  exactly  the  opposite  approach  is  best. 

Here  is  a  clever  use  of  Green's  Theorem:  We  know  that  areas  can  be  computed  using 
double  integrals,  namely, 

D 

computes  the  area  of  region  D.  If  we  can  find  P  and  Q  so  that  dQ/dx  —  dP/dy  =  1,  then 
the  area  is  also 

/    Pdx  +  Qdy. 

JdD 

It  is  quite  easy  to  do  this:  P  =  0,Q  =  x  works,  as  do  P  =  — y, Q  =  0  and  P  =  —y/2,  Q  = 
x/2. 


EXAMPLE  16.4.3  An  ellipse  centered  at  the  origin,  with  its  two  principal  axes  aligned 
with  the  X  and  y  axes,  is  given  by 

We  find  the  area  of  the  interior  of  the  ellipse  via  Green's  theorem.  To  do  this  we  need  a 
vector  equation  for  the  boundary;  one  such  equation  is  (a  cos  t,  h  sin  t) ,  as  t  ranges  from  0 
to  27r.  We  can  easily  verify  this  by  substitution: 

x"^     y^         cos^  t     6^  sin^  t         o  -2 

^  +  7^  =   o  1  To —  =  cos  t  +  sm  t  =  l. 

Let's  consider  the  three  possibilities  for  P  and  Q  above:  Using  0  and  x  gives 

/»27r  /*27r 

Odx  +  xdy=  /     a  cos{t)b cos{t)  dt  —  /  abcos^{t)dt. 


c  Jo  Jo 
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Using  —y  and  0  gives 

(p  —ydx  +  Ody=  /  —bsiia.{t){—asm{t))dt=  /  absiia^{t)dt. 
Jc  Jo  Jo 

Finally,  using  —y/2  and  x/2  gives 


/  V  ,  X  ,  f'^'^  6sin(t)  ,  .  ,  XX  ,  ocos(t) 
j        dx  +  -dy=  ^(_asin(0)(it+— ^ 


{bcos{t))dt 


''^''absia^t     abcos'^t,       f'^'' ab  , 

-  dt  =  nab. 


r'' absmH     abcosH  ,  T 


The  first  two  integrals  are  not  particularly  difficult,  but  the  third  is  very  easy,  though  the 
choice  of  P  and  Q  seems  more  complicated.  □ 


Proof  of  Green's  Theorem.  We  cannot  here  prove  Green's  Theorem  in  general,  but 
we  can  do  a  special  case.  We  seek  to  prove  that 


f  Pdx  +  Qdy  =  ll^-^dA. 

^  D 


It  is  sufficient  to  show  that 


j  Pdx  =  jj -^dA       and       j  Qdy^  jj  ^dA, 

D  ^  D 


which  we  can  do  if  we  can  compute  the  double  integral  in  both  possible  ways,  that  is,  using 
dA  =  dy  dx  and  dA  =  dx  dy. 
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For  the  first  equation,  we  start  with 

'a    J  gi{x) 


D 


dy 


r-b     r-g2{x)  QP  nh 

dA  =  /    /        ——dydx=  /   P{x,g2{x))  —  P{x,gi{x))  dx. 
J  a  Jq,(x)  oy 


Here  we  have  simply  used  the  ordinary  Fundamental  Theorem  of  Calculus,  since  for  the 
inner  integral  we  are  integrating  a  derivative  with  respect  to  y:  an  antiderivative  of  dP/dy 
with  respect  to  y  is  simply         y),  and  then  we  substitute  gi  and  (72  for  y  and  subtract. 

Now  we  need  to  manipulate  §^Pdx.  The  boundary  of  region  D  consists  of  4  parts, 
given  by  the  equations  y  =  gi{x),  x  =  b,  y  =  g2{x),  and  x  =  a.  On  the  portions  x  =  b 
and  X  =  a,  dx  =  0  dt,  so  the  corresponding  integrals  are  zero.  For  the  other  two  portions, 
we  use  the  parametric  forms  x  =  t,  y  =  gi{t),  a  <  t  <  b,  and  x  =  t,  y  =  g2{t),  letting  t 
range  from  b  to  a,  since  we  are  integrating  counter-clockwise  around  the  boundary.  The 
resulting  integrals  give  us 


/"  fb  pa  />&  pb 

f  Pdx=      P{t,g,{t))dt+       P{t,g2{t))dt=       P{t,gi{t))dt-  P{t,g2{t)) 

J  C  J  a  J  b  J  a  J  a 

P{t,gi{t))-P{t,g2{t))dt 


dt 


which  is  the  result  of  the  double  integral  times  —1,  as  desired. 

The  equation  involving  Q  is  essentially  the  same,  and  left  as  an  exercise. 

Exercises  16.4- 

1.  Compute  /     2ydx  +  3xdy,  where  D  is  described  byO<a;<l,  0<y<l.  => 

JdD 


2.  Compute  /     xy  dx  +  xydy,  where  D  is  described  byO<a;<l,  0<y<l.  =^ 

JdD 

3.  Compute  /     g'^^+^y  (j^x  +  e^^  dy,  where  D  is  described  by  —2  <  x  <  2,  —1  <  y  <  1.  = 

JdD 

4.  Compute  /     ycosxdx  +  ysinxdy,  where  D  is  described  by  0  <  a;  <  n/2,  1  <  y  <  2. 

JdD 

5.  Compute  /     x^ydx  +  xy^  dy,  where  D  is  described  byO<x<l,  0<j/<x.  =^> 

JdD 

6.  Compute  /     x^/y  dx  +  \/x  +  y  dy,  where  D  is  described  by  1  <  a;  <  2,  23;  <  y  <  4.  = 

JdD 

7.  Compute  /   (./„)  dx  +  (2  +  3.)  <ly,  where  D  is  described  by  1  <  .  <  2,  1  <  !,  <  x^ 

JdD 

8.  Compute  /     sinydx  +  sina;dy,  where  D  is  described  by  0  <  x  <  7r/2,  x  <  y  <  7r/2. 

JdD 

9.  Compute  /    xlnydx,  where  D  is  described  by  1  <  a;  <  2,      <  y  <      .  =J> 

JdD 
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10.  Compute  /     \/l  +     dy,  where  D  is  described  by  —  1  <  x  <  1,      <  y  <  1. 

JdD 

11.  Compute  /     x^ydx  —  xy^  dy,  where  D  is  described  by  x'^  +  y^  <  1.  =^ 

JdD 


12.  Compute  /     y  dx  +  2x  dy,  where  D  is  described  by  x  +  y   <  4.  =^ 

JdD 

13.  Evaluate  (j)  {y  —  sin(x))  dx  +  cos(x)  dy,  where  C  is  the  boundary  of  the  triangle  with  vertices 
(0,0),  (1,0),  and  (1,2)  oriented  counter-clockwise.  =^> 

14.  Finish  our  proof  of  Green's  Theorem  by  showing  that  (f  Q  dy  =  f  f  dA. 


c 


J  J  dx 


D 


16.5    Divergence  and  Curl 

Divergence  and  curl  are  two  measurements  of  vector  fields  that  are  very  useful  in  a  variety  of 
applications.  Both  are  most  easily  understood  by  thinking  of  the  vector  field  as  representing 
a  flow  of  a  liquid  or  gas;  that  is,  each  vector  in  the  vector  field  should  be  interpreted  as  a 
velocity  vector.  Roughly  speaking,  divergence  measures  the  tendency  of  the  fluid  to  collect 
or  disperse  at  a  point,  and  curl  measures  the  tendency  of  the  fluid  to  swirl  around  the  point. 
Divergence  is  a  scalar,  that  is,  a  single  number,  while  curl  is  itself  a  vector.  The  magnitude 
of  the  curl  measures  how  much  the  fluid  is  swirling,  the  direction  indicates  the  axis  around 
which  it  tends  to  swirl.  These  ideas  are  somewhat  subtle  in  practice,  and  are  beyond 
the  scope  of  this  course.  You  can  find  additional  information  on  the  web,  for  example  at 
http :  //mathinsight .  org/ curl_idea  and  http :  //mathinsight .  org/divergence_idea 
and  in  many  books  including  Div,  Grad,  Curl,  and  All  That:  An  Informal  Text  on  Vector 
Calculus,  by  H.  M.  Schey. 

Recall  that  if  /  is  a  function,  the  gradient  of  /  is  given  by 

V/  =  /^  ^  ^ 

\dx'  dy'  dz 

A  useful  mnemonic  for  this  (and  for  the  divergence  and  curl,  as  it  turns  out)  is  to  let 

d    d  d 


\dx'  dy'  dz 

that  is,  we  pretend  that  V  is  a  vector  with  rather  odd  looking  entries.  Recalling  that 
{u,  V,  w)a  —  {ua,  va,  wa),  we  can  then  think  of  the  gradient  as 


dx  dy  dz  /        \  dx  dy  dz 
that  is,  we  simply  multiply  the  /  into  the  vector. 
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The  divergence  and  curl  can  now  be  defined  in  terms  of  tliis  same  odd  vector  V  by 
using  the  cross  product  and  dot  product.  The  divergence  of  a  vector  field  F  =  (/,  g,  h)  is 

d    d  d 


V-F 


dx^  dy^  dz  /      '  '         dx     dy  dz 


The  curl  of  F  is 

V  X  F  = 

Here  are  two  simple  but  useful  facts  about  divergence  and  curl 


i 

j 

k 

d_ 

d_ 

d_ 

dx 

dy 

dz 

f 

9 

h 

dh     dg  df     dh  dg  df 

dy     dz '  dz     dx'  dx  dy 


THEOREM  16.5.1     V   (V  x  F)  =  0.  ■ 

In  words,  this  says  that  the  divergence  of  the  curl  is  zero. 

THEOREM  16.5.2    V  x  (V/)  =  0.  ■ 

That  is,  the  curl  of  a  gradient  is  the  zero  vector.  Recalling  that  gradients  are  conser- 
vative vector  fields,  this  says  that  the  curl  of  a  conservative  vector  field  is  the  zero  vector. 
Under  suitable  conditions,  it  is  also  true  that  if  the  curl  of  F  is  0  then  F  is  conservative. 
(Note  that  this  is  exactly  the  same  test  that  we  discussed  on  page  425.) 

EXAMPLE  16.5.3  Let  F  =  (e^  1,  xe^).  Then  V  x  F  =  (0,  -  e^  0)  =  0.  Thus,  F  is 
conservative,  and  we  can  exhibit  this  directly  by  finding  the  corresponding  /. 

Since  /x  =  e^,  /  =  xe^  +  g{y,  z).  Since  fy  =  1,  it  must  be  that  gy  =  1,  so  g{y,  z)  = 
y  +  h{z).  Thus  /  =  xe^  +  y  +  h{z)  and 

xe^  =  f^xe'^  +  0  +  h'{z), 

so  h'{z)  =  0,  i.e.,  h{z)  =  C,  and  /  =  xe^  +  y  +  C.  □ 

We  can  rewrite  Green's  Theorem  using  these  new  ideas;  these  rewritten  versions  in 
turn  are  closer  to  some  later  theorems  we  will  see. 

Suppose  we  write  a  two  dimensional  vector  field  in  the  form  F  =  (P,  Q,  0),  where  P 
and  Q  are  functions  of  x  and  y.  Then 


V  X  F 


i      j  k 

d  d  d 
dx      dy  dz 


ax      ay  Oz 

P     Q  0 

and  so  (V  X  F)  ■  k  =  (0,  0,      -  Py)  ■  (0,  0, 1)  =  Qx  -  Py  So  Green's  Theorem  says 


F  •  dr 


dD 


dD 


Pdx  +  Qdy  =  JjQ^-PydA  =  JJ  {V  xF)  -  kdA.  (16.5.1) 

D  D 
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Roughly  speaking,  the  right-most  integral  adds  up  the  curl  (tendency  to  swirl)  at  each 
point  in  the  region;  the  left-most  integral  adds  up  the  tangential  components  of  the  vector 
field  around  the  entire  boundary.  Green's  Theorem  says  these  are  equal,  or  roughly,  that 
the  sum  of  the  "microscopic"  swirls  over  the  region  is  the  same  as  the  "macroscopic"  swirl 
around  the  boundary. 

Next,  suppose  that  the  boundary  dD  has  a  vector  form  r(i),  so  that  r'(t)  is  tangent  to 
the  boundary,  and  T  =  r'(t)/|r'(t)  |  is  the  usual  unit  tangent  vector.  Writing  r  =  {x(t),y{t)) 
we  get 

\r'{t)\ 

and  then 

\r'it)\ 

is  a  unit  vector  perpendicular  to  T,  that  is,  a  unit  normal  to  the  boundary.  Now 
/    F-Nds=  [   (P,Q)  ■  %-^|r'(t)|dt  =  /  Py'dt-Qx'dt 

JdD  JdD  \^  JdD 


P  dy  —  Q  dx  =  /    —Qdx  +  Pdy. 

dD  JdD 

So  far,  we've  just  rewritten  the  original  integral  using  alternate  notation.  The  last  integral 
looks  just  like  the  left  side  of  Green's  Theorem  (16.4.1)  except  that  P  and  Q  have  traded 
places  and  Q  has  acquired  a  negative  sign.  Then  applying  Green's  Theorem  we  get 

-Qdx  +  Pdy  =  jj  P^  +  QydA  =  jj  V  -YdA. 

D  D 

Summarizing  the  long  string  of  equalities, 

F -Nds  =  1 1  V -FdA.  (16.5.2) 

dD  J  J 

D 

Roughly  speaking,  the  first  integral  adds  up  the  flow  across  the  boundary  of  the  region, 
from  inside  to  out,  and  the  second  sums  the  divergence  (tendency  to  spread)  at  each  point 
in  the  interior.  The  theorem  roughly  says  that  the  sum  of  the  "microscopic"  spreads  is  the 
same  as  the  total  spread  across  the  boundary  and  out  of  the  region. 
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Exercises  16.5. 


1.  Let  F  =  {xy,  —xy)  and  let  D  be  given  by  0  <  x  <  1,  0  <  y  <  1.  Compute  /     Y  ■  dr  and 

J  dD 


F  •  Nds. 

dD 


2.  Let  F  =  {ax^,by^)  and  let  D  be  given  by  0  <  a;  <  1,  0  <  y  <  1.  Compute  /     F  •  dr  and 

JdD 

L 


F-Nds. 

dD 


3.  Let  F  =  {ay^  ,hx^)  and  let  D  be  given  by  0  <  x  <  1,  0  <  y  <  a;.  Compute  /     F  •  dr  and 

J  dD 

[    F-Nds.  ^ 

JdD 

4.  Let  F  =  (sin a;  cos y,  cos  X  sin  y)  and  let  D  be  given  by  0  <  x  <  tv/2,  0  <  y  <  x.  Compute 

/    F  •  dr  and  /    F  •  N  ds.  ^ 

JdD  JdD 

5.  Let  F  =  (y,  — x)  and  let  D  be  given  by  x^  +  y^  <  1.  Compute  /     F  •  dr  and  /  F-Nds. 

_^  JdD  JdD 

6.  Let  F  =  (x,  y)  and  let  D  be  given  by  x^  +  y'^  <  1.  Compute  /     F  •  dr  and  /     F  •  Nds.  =^ 

JdD  JdD 

7.  Prove  theorem  16.5.1. 

8.  Prove  theorem  16.5.2. 

9.  If  V  •  -F  =  0,  F  is  said  to  be  incompressible.   Show  that  any  vector  field  of  the  form 
F{x,y,z)  =  {f{y,z),g{x,z),h{x,y))  is  incompressible.  Give  a  non-trivial  example. 

16.6    Vector  Functions  for  Surfaces 

We  have  dealt  extensively  with  vector  equations  for  curves,  r{t)  =  {x{t),y(t),  z{t)).  A 
similar  technique  can  be  used  to  represent  surfaces  in  a  way  that  is  more  general  than  the 
equations  for  surfaces  we  have  used  so  far.  Recall  that  when  we  use  r{t)  to  represent  a 
curve,  we  imagine  the  vector  r(t)  with  its  tail  at  the  origin,  and  then  we  follow  the  head 
of  the  arrow  as  t  changes.  The  vector  "draws"  the  curve  through  space  as  t  varies. 
Suppose  we  instead  have  a  vector  function  of  two  variables, 

r{u,v)  =  {x{u,v),y{u,v),z{u,v)). 

As  both  u  and  v  vary,  we  again  imagine  the  vector  r(w,  v)  with  its  tail  at  the  origin,  and 
its  head  sweeps  out  a  surface  in  space.  A  useful  analogy  is  the  technology  of  CRT  video 
screens,  in  which  an  electron  gun  fires  electrons  in  the  direction  of  the  screen.  The  gun's 
direction  sweeps  horizontally  and  vertically  to  "paint"  the  screen  with  the  desired  image. 
In  practice,  the  gun  moves  horizontally  through  an  entire  line,  then  moves  vertically  to  the 
next  line  and  repeats  the  operation.  In  the  same  way,  it  can  be  useful  to  imagine  fixing  a 
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value  of  V  and  letting  y{u,  v)  sweep  out  a  curve  as  u  changes.  Then  v  can  change  a  bit,  and 
y{u,v)  sweeps  out  a  new  curve  very  close  to  the  first.  Put  enough  of  these  curves  together 
and  they  form  a  surface. 

EXAMPLE  16.6.1  Consider  the  function  r(u,  t;)  =  (f  cos  w,  f  sinu,  f ).  For  a  fixed  value 
of  f ,  as  u  varies  from  0  to  27r,  this  traces  a  circle  of  radius  v  at  height  v  above  the  x-y 
plane.  Put  lots  and  lots  of  these  together, and  they  form  a  cone,  as  in  figure  16.6.1.  □ 


Figure  16.6.1     Tracing  a  surface. 

EXAMPLE  16.6.2  Let  r  =  (f  cos  m,  f  sin  w,  w).  If  v  is  constant,  the  resulting  curve  is  a 
helix  (as  in  figure  13.1.1).  If  u  is  constant,  the  resulting  curve  is  a  straight  line  at  height 
u  in  the  direction  u  radians  from  the  positive  x  axis.  Note  in  figure  16.6.2  how  the  helixes 
and  the  lines  both  paint  the  same  surface  in  a  different  way.  □ 
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This  technique  aUows  us  to  represent  many  more  surfaces  than  previously. 

EXAMPLE  16.6.3     The  curve  given  by 

r  =  ((2  +  cos(3w,/2))  cos  u,  (2  +  cos(3w/2))  sinw,  sin(3it/2)) 

is  caUed  a  trefoil  knot.  Recall  that  from  the  vector  equation  of  the  curve  we  can  compute 
the  unit  tangent  T,  the  unit  normal  N,  and  the  binormal  vector  B  =  T  x  N;  you  may 
want  to  review  section  13.3.  The  binormal  is  perpendicular  to  both  T  and  N;  one  way  to 
interpret  this  is  that  N  and  B  define  a  plane  perpendicular  to  T,  that  is,  perpendicular  to 
the  curve;  since  N  and  B  are  perpendicular  to  each  other,  they  can  function  just  as  i  and 
j  do  for  the  x-y  plane.  So,  for  example,  c{v)  =  Ncosf  +  Bsinf  is  a  vector  equation  for 
a  unit  circle  in  a  plane  perpendicular  to  the  curve  described  by  r,  except  that  the  usual 
interpretation  of  c  would  put  its  center  at  the  origin.  We  can  fix  that  simply  by  adding  c 
to  the  original  r:  let  f  =  r{u)  +  c{v).  For  a  fixed  u  this  draws  a  circle  around  the  point 
r(w);  as  u  varies  we  get  a  sequence  of  such  circles  around  the  curve  r,  that  is,  a  tube  of 
radius  1  with  r  at  its  center.  We  can  easily  change  the  radius;  for  example  r{u)  +  ac{v) 
gives  the  tube  radius  a;  we  can  make  the  radius  vary  as  we  move  along  the  curve  with 
r{u)  +  g{u)c{v),  where  g{u)  is  a  function  of  u.  As  shown  in  figure  16.6.3,  it  is  hard  to  see 
that  the  plain  knot  is  knotted;  the  tube  makes  the  structure  apparent.  Of  course,  there  is 
nothing  special  about  the  trefoil  knot  in  this  example;  we  can  put  a  tube  around  (almost) 
any  curve  in  the  same  way.  □ 


Figure  16.6.3     Tubes  around  a  trefoil  knot,  with  radius  1/2  and  3cos(u)/4. 

We  have  previously  examined  surfaces  given  in  the  form  f{x,y).  It  is  sometimes 
useful  to  represent  such  surfaces  in  the  more  general  vector  form,  which  is  quite  easy: 
r{u,v)  =  (w,  f , /(w,  f )).  The  names  of  the  variables  are  not  important  of  course;  instead 
of  disguising  x  and  y,  we  could  simply  write  r(x,  y)  =  (x,  y,  /(x,  y)). 
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We  have  also  previously  dealt  with  surfaces  that  are  not  functions  of  x  and  y;  many 
of  these  are  easy  to  represent  in  vector  form.  One  common  type  of  surface  that  cannot  be 
represented  as  ^  =  f{x,y)  is  a  surface  given  by  an  equation  involving  only  x  and  y.  For 
example,  x  +  y  =  1  and  y  =  x"^  are  "vertical"  surfaces.  For  every  point  (x,  y)  in  the  plane 
that  satisfies  the  equation,  the  point  (x,  y,  z)  is  on  the  surface,  for  every  value  of  z.  Thus, 
a  corresponding  vector  form  for  the  surface  is  something  like  {f{u),g{u),v);  for  example, 
X  +  y  =  1  becomes  (m,  1  —  u,v)  and  y  =  x'^  becomes  (u,  u^,v). 

Yet  another  sort  of  example  is  the  sphere,  say  x^  +  y^  +  z^  =  1.  This  cannot  be  written 
in  the  form  z  =  f{x,y),  but  it  is  easy  to  write  in  vector  form;  indeed  this  particular 
surface  is  much  like  the  cone,  since  it  has  circular  cross-sections,  or  we  can  think  of  it  as 
a  tube  around  a  portion  of  the  2;-axis,  with  a  radius  that  varies  depending  on  where  along 
the  axis  we  are.  One  vector  expression  for  the  sphere  is  (Vl  —  f  ^  cos  m,  a/1  —  sinw,  v) — 
this  emphasizes  the  tube  structure,  as  it  is  naturally  viewed  as  drawing  a  circle  of  radius 
Vl  —  f  ^  around  the  2;-axis  at  height  v.  We  could  also  take  a  cue  from  spherical  coordinates, 
and  write  (sin  w  cos  f,  sin  w  sin  f,  cosw),  where  in  effect  u  and  v  are  4>  and  9  in  disguise. 

It  is  quite  simple  in  Sage  to  plot  any  surface  for  which  you  have  a  vector  representation. 
Using  different  vector  functions  sometimes  gives  different  looking  plots,  because  Sage  in 
effect  draws  the  surface  by  holding  one  variable  constant  and  then  the  other.  For  example, 
you  might  have  noticed  in  figure  16.6.2  that  the  curves  in  the  two  right-hand  graphs  are 
superimposed  on  the  left-hand  graph;  the  graph  of  the  surface  is  just  the  combination  of 
the  two  sets  of  curves,  with  the  spaces  filled  in  with  color. 

Here's  a  simple  but  striking  example:  the  plane  x  +  y  +  z  =  1  can  be  represented 
quite  naturally  as  {u,v,l  —  u  —  v).  But  we  could  also  think  of  painting  the  same  plane  by 
choosing  a  particular  point  on  the  plane,  say  (1,0,0),  and  then  drawing  circles  or  ellipses 
(or  any  of  a  number  of  other  curves)  as  if  that  point  were  the  origin  in  the  plane.  For 
example,  {1  —  v  cos  u  —  v  sin  u,  v  sin  u,  v  cos  u)  is  one  such  vector  function.  Note  that  while 
it  may  not  be  obvious  where  this  came  from,  it  is  quite  easy  to  see  that  the  sum  of  the 
X,  y,  and  z  components  of  the  vector  is  always  1.  Computer  renderings  of  the  plane  using 
these  two  functions  are  shown  in  figure  16.6.4. 

Suppose  we  know  that  a  plane  contains  a  particular  point  (xq,  yQ^zo)  and  that  two 
vectors  u  =  (wq,  "Ui,  ^2)  and  v  =  {vq,  fi,  V2)  are  parallel  to  the  plane  but  not  to  each  other. 
We  know  how  to  get  an  equation  for  the  plane  in  the  form  ax  +  by  +  cz  =  d,  by  first 
computing  u  x  v.  It's  even  easier  to  get  a  vector  equation: 

r(u,  v)  =  (xo,  2/0,  ^0)  +UU  +  vv. 

The  first  vector  gets  to  the  point  (xq,  yo,  zq)  and  then  by  varying  u  and  v,  uu  +  vv  gets  to 
every  point  in  the  plane. 

Returning  to  x  +  y  +  z  =  1,  the  points  (1,0,0),  (0, 1,0),  and  (0,0, 1)  are  all  on  the 
plane.  By  subtracting  coordinates  we  see  that  (—1,0, 1)  and  (—1, 1,0)  are  parallel  to  the 
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1 


Figure  16.6.4     Two  representations  of  the  same  plane, 
plane,  so  a  third  vector  form  for  this  plane  is 

(1,0,0)  0,1)  +  1,0)  =  {l-u-v,v,u). 

This  is  clearly  quite  similar  to  the  first  form  we  found. 

We  have  already  seen  (section  15.4)  how  to  find  the  area  of  a  surface  when  it  is  defined 
in  the  form  /(x,  y).  Finding  the  area  when  the  surface  is  given  as  a  vector  function  is  very 
similar.  Looking  at  the  plots  of  surfaces  we  have  just  seen,  it  is  evident  that  the  two  sets 
of  curves  that  fill  out  the  surface  divide  it  into  a  grid,  and  that  the  spaces  in  the  grid  are 
approximately  parallelograms.  As  before  this  is  the  key:  we  can  write  down  the  area  of  a 
typical  little  parallelogram  and  add  them  all  up  with  an  integral. 

Suppose  we  want  to  approximate  the  area  of  the  surface  y{u,v)  near  r(wo,fo)-  The 
functions  y{u,vq)  and  y{uq,v)  define  two  curves  that  intersect  at  y{uq,vq).  The  deriva- 
tives of  r  give  us  vectors  tangent  to  these  two  curves:  ru(uo,fo)  and  y^{uq,vq),  and  then 
Yu{uq^vq)  du  and  Y^iuQ^vo)  dv  are  two  small  tangent  vectors,  whose  lengths  can  be  used 
as  the  lengths  of  the  sides  of  an  approximating  parallelogram.  Finally,  the  area  of  this 
parallelogram  is  |r^j  x  r„|  dudv  and  so  the  total  surface  area  is 


X  r„|  dudv. 


EXAMPLE  16.6.4  We  find  the  area  of  the  surface  (f  cos  w,  f  sin  w,  w)  for  0  <  "u  <  tt 
and  0  <      <  1;  this  is  a  portion  of  the  helical  surface  in  figure  16.6.2.   We  compute 
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Tu  =  (— f  siiiM,  f  COS  M,  1)  and  r„  =  (cos  u,  sinu,  0).  The  cross  product  of  these  two  vectors 


Exercises  16.6. 

1.  Describe  or  sketch  the  surface  with  the  given  vector  function. 

a.  y{u,  v)  =  {u  +  v,3  —  v,l-\-4u  +  5v) 

b.  r{u,v)  =  {2smu,3cosu,v) 

c.  r(s,t)  =  {s,t,t^  -  s^) 

d.  r(s,  t)  =  (s  sin  2t,  s'^,  s  cos  2t) 

2.  Find  a  parametric  representation,  r{u,v),  for  the  surface. 

a.  The  plane  that  passes  through  the  point  (1,  2,  —3)  and  is  paraUel  to  the  vectors  (1, 1,  — 1) 
and  (1,-1,1). 

b.  The  lower  hah  of  the  ellipsoid  2x^  +       +      =  1. 

c.  The  part  of  the  sphere  of  radius  4  centered  at  the  origin  that  lies  between  the  planes 
z  =  —2  and  z  =  2. 

3.  Find  the  area  of  the  portion  of  x  +  2y  +  42;  =  10  in  the  first  octant.  =^> 

4.  Find  the  area  of  the  portion  of  2x  +  4y  +  2;  =  0  inside      +     =  1.  =^> 

5.  Find  the  area  oi  z  =  x'^  +      that  lies  below  z  =  1.  =^> 

6.  Find  the  area  of  z  =     x^  +  y'^  that  lies  below  z  =  2.  =^> 

7.  Find  the  area  of  the  portion  of      +  j/^  +  z^  =      that  lies  in  the  first  octant.  =^ 

8.  Find  the  area  of  the  portion  ofx  +  y  +  z  =  a  that  lies  above  x  +  y  <  6  .  =^ 

9.  Find  the  area     z  =  x"^  —  y^  that  lies  inside  x"^  +  y"^  =  a? .  =^ 

10.  Find  the  area  oi  z  =  xy  that  lies  inside  x'^  +  y"^  =  a? .  =^ 

11.  Find  the  area  of  a;'^  +      +  z^  =  a?  that  lies  above  the  interior  of  the  circle  given  in  polar 
coordinates  by  r  =  a  cos  6.  =^ 

12.  Find  the  area  of  the  cone  z  =  k^^ x'^  +  y'^  that  lies  above  the  interior  of  the  circle  given  in 
polar  coordinates  by  r  =  a  cos  6.  =^> 

13.  Find  the  area  of  the  plane  z  =  ax  +  by  +  c  that  lies  over  a  region  D  with  area  A.  =^ 

14.  Find  the  area  of  the  cone  z  =  A;Y^x^~+~y^  that  lies  over  a  region  D  with  area  A.  =^ 

15.  Find  the  area  of  the  cylinder  x  +  z   =  a   that  lies  inside  the  cylinder  x  +  y   =  a  .  =^ 

16.  The  surface  f{x,y)  can  be  represented  with  the  vector  function  {x,y,  f{x,y)).  Set  up  the 
surface  area  integral  using  this  vector  function  and  compare  to  the  integral  of  section  15.4. 


y/l  +  v"^,  and  the  surface  area  is 


□ 
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16.7    Surface  Integrals 

In  the  integral  for  surface  area, 


X  r^l  dudv, 


the  integrand  {r^  x  r^,|  dudv  is  the  area  of  a  tiny  parallelogram,  that  is,  a  very  small  surface 
area,  so  it  is  reasonable  to  abbreviate  it  dS;  then  a  shortened  version  of  the  integral  is 

D 

We  have  already  seen  that  if  D  is  a  region  in  the  plane,  the  area  of  D  may  be  computed 
with 

D 

so  this  is  really  quite  familiar,  but  the  dS  hides  a  little  more  detail  than  does  dA. 
Just  as  we  can  integrate  functions  f{x,  y)  over  regions  in  the  plane,  using 

JJ  f{x,y)dA, 

D 

SO  we  can  compute  integrals  over  surfaces  in  space,  using 

J  J  f{x,y,z)dS. 

D 

In  practice  this  means  that  we  have  a  vector  function  r(w,  v)  =  {x{u,  v),  y{u,  v),  z{u,  v))  for 
the  surface,  and  the  integral  we  compute  is 


f{x{u,v),y{u,v),z{u,v))\ru  x  r^l  dudv. 


That  is,  we  express  everything  in  terms  of  u  and  v,  and  then  we  can  do  an  ordinary  double 
integral. 

EXAMPLE  16.7.1  Suppose  a  thin  object  occupies  the  upper  hemisphere  of  +  + 
z'^  =  1  and  has  density  a{x,y,z)  =  z.  Find  the  mass  and  center  of  mass  of  the  object. 
(Note  that  the  object  is  just  a  thin  shell;  it  does  not  occupy  the  interior  of  the  hemisphere.) 
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We  write  the  hemisphere  as  r{(f),$)  =  (cos  ^  sin  0,  sin  ^  sin  0,  cos  0) ,  0  <  0  <  7r/2  and 
0  <  ^  <  27r.  So  To  =  (— sin  ^  sin  0,  cos  ^  sin  0,0)  and  r<^  =  (cos  ^  cos  0,  sin  ^  cos  0,  —  sin  0) . 
Then 

r^i  X  r<^  =  (  —  cos  9  sin^  0,  —  sin  9  sin^  0,  —  cos  0  sin  0) 

and 

|r6i  X  r^l  =  I  sin 01  =  sin0, 

since  we  are  interested  only  inO<0<7r/2.  Finally,  the  density  is  ^  =  cos0  and  the 
integral  for  mass  is 

f27r  p7r/2 

COS  0  sin  (f)d(f)d9  =  tt. 


io  io 


By  symmetry,  the  center  of  mass  is  clearly  on  the  2-axis,  so  we  only  need  to  find  the 
^-coordinate  of  the  center  of  mass.  The  moment  around  the  x-y  plane  is 

27r     /'7r/2  /»27r    pi^/l  27r 

/      z  COS  (f)  sin  (f)  d(f)  d9  =  /      /      cos^  (f)  sin  (f)  d(f)  d9  =  —, 
Jo  Jo    Jo  3 

so  the  center  of  mass  is  at  (0, 0, 2/3).  □ 

Now  suppose  that  F  is  a  vector  field;  imagine  that  it  represents  the  velocity  of  some 
fiuid  at  each  point  in  space.  We  would  like  to  measure  how  much  fiuid  is  passing  through 
a  surface  D,  the  flux  across  D.  As  usual,  we  imagine  computing  the  fiux  across  a  very 
small  section  of  the  surface,  with  area  dS,  and  then  adding  up  all  such  small  fluxes  over  D 
with  an  integral.  Suppose  that  vector  N  is  a  unit  normal  to  the  surface  at  a  point;  F  •  N 
is  the  scalar  projection  of  F  onto  the  direction  of  N,  so  it  measures  how  fast  the  fluid  is 
moving  across  the  surface.  In  one  unit  of  time  the  fluid  moving  across  the  surface  will  flU  a 
volume  of  F  ■  N  dS,  which  is  therefore  the  rate  at  which  the  fluid  is  moving  across  a  small 
patch  of  the  surface.  Thus,  the  total  flux  across  D  is 


F-NdS  =  JJ  F  -  dS, 

D  D 

deflning  dS  =  N  dS.  As  usual,  certain  conditions  must  be  met  for  this  to  work  out;  chief 
among  them  is  the  nature  of  the  surface.  As  we  integrate  over  the  surface,  we  must  choose 
the  normal  vectors  N  in  such  a  way  that  they  point  "the  same  way"  through  the  surface. 
For  example,  if  the  surface  is  roughly  horizontal  in  orientation,  we  might  want  to  measure 
the  flux  in  the  "upwards"  direction,  or  if  the  surface  is  closed,  like  a  sphere,  we  might  want 
to  measure  the  flux  "outwards"  across  the  surface.  In  the  flrst  case  we  would  choose  N  to 
have  positive  z  component,  in  the  second  we  would  make  sure  that  N  points  away  from  the 
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origin.  Unfortunately,  there  are  surfaces  that  are  not  orientable:  they  have  only  one  side, 
so  that  it  is  not  possible  to  choose  the  normal  vectors  to  point  in  the  "same  way"  through 
the  surface.  The  most  famous  such  surface  is  the  Mobius  strip  shown  in  figure  16.7.1.  It 
is  quite  easy  to  make  such  a  strip  with  a  piece  of  paper  and  some  tape.  If  you  have  never 
done  this,  it  is  quite  instructive;  in  particular,  you  should  draw  a  line  down  the  center  of 
the  strip  until  you  return  to  your  starting  point.  No  matter  how  unit  normal  vectors  are 
assigned  to  the  points  of  the  Mobius  strip,  there  will  be  normal  vectors  very  close  to  each 
other  pointing  in  opposite  directions. 


Assuming  that  the  quantities  involved  are  well  behaved,  however,  the  flux  of  the  vector 
field  across  the  surface  r{u,v)  is 


In  practice,  we  may  have  to  use  r„  x  or  even  something  a  bit  more  complicated  to  make 
sure  that  the  normal  vector  points  in  the  desired  direction. 

EXAMPLE  16.7.2  Compute  the  flux  of  F  =  {x,  y,  z^)  across  the  cone  z  =  \J +  , 
0  <  2  <  1,  in  the  downward  direction. 

We  write  the  cone  as  a  vector  function:  r  =  (f  cos m, f  sinu, f ),  0  <  w  <  27r  and 
0  <  f  <  1.    Then       =  (— f  sinu,  f  cos  w,  0)  and  r„  =  (cos  w,  sinu,  1)  and  r^,  x  r„  = 


Figure  16.7.1     A  Mobius  strip. 


D 


D 


D 
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(f  cosM,  V  sinw,  —v).  The  third  coordinate  —v  is  negative,  which  is  exactly  what  we  desire, 
that  is,  the  normal  vector  points  down  through  the  surface.  Then 


/»27r     /•!  /•27r  /»1 

/  /  {x,y,  z"^)  ■  {v  cosu^v  s'mu, —v)  dv  du  =  /  /  xv  cos  u  +  yv  s'mu  —  z'^v  dv  du 
Jo    Jo  Jo  Jo 


271-  pi 

2        2       I      2    •    2  5  J  J 

V  COS  w  +  f  Sin  u  —  V  dv  du 


0  JO 
271-  ,.1 


0  ^0 


2         5   J  7 

f  —     dv  du  =  —. 


□ 


Exercises  16.7. 

1.  Find  the  center  of  mass  of  an  object  that  occupies  the  upper  hemisphere  of  x'^  +  y'^  +  z'^  =  1 
and  has  density  a;^  +  y^.  =J> 

2.  Find  the  center  of  mass  of  an  object  that  occupies  the  surface  z  =  xy,  0<a;<l,0<y<l 
and  has  density       +  x^  +  y^. 

3.  Find  the  center  of  mass  of  an  object  that  occupies  the  surface  z  =  \/ x"^  +  y^,  1  <    <  4  and 
has  density  x'^z.  =^ 

4.  Find  the  centroid  of  the  surface  of  a  right  circular  cone  of  height  h  and  base  radius  r,  not 
including  the  base.  =^ 

5.  Evaluate  J j (2,  -3,4)  •  NdS,  where  D  is  given  by  z  =      +  y^,  -1  <  a;  <  1,  -1  <  y  <  1, 

D 

oriented  up. 

6.  Evaluate  j  j  {x,  y,  3)  •  N  dS,  where  D  is  given  by  2  =  3^;  —  5y,  l<x<2,  0<y<2,  oriented 

D 

up.  =^> 

7.  Evaluate  JJ {x,  y,  —2)  •  N  dS,  where  D  is  given  by  z  =  1  —      —  y'^,  a;^  +  y'^  <  1,  oriented  up. 

D 

8.  Evaluate  JJ  {xy,  yz,  zx)  ■  ISi  dS,  where  D  is  given  byz  =  x  +  y^  +  2,  0<a;<l,  a;<y<l, 

D 

oriented  up. 

9.  Evaluate  JJ {e^ ,    ,z)  •  N  dS,  where  D  is  given  hy  z  =  xy,  0  <  x  <  1,  —x  <  y  <  x,  oriented 

D 

up. 

r  r 

10.  Evaluate  /  /  {xz,  yz,  z)  ■  N  dS,  where  D  is  given  by^  =  a  —  x  —  y  ,  x  +  y  <  b  ,  oriented 

D 

up.  =^> 

11.  A  fluid  has  density  870  kg/m^  and  flows  with  velocity  v  =  {z,y'^ ,  x'^) ,  where  distances  are 
in  meters  and  the  components  of  v  are  in  meters  per  second.  Find  the  rate  of  flow  outward 
through  the  portion  of  the  cylinder  x  +y   =4,  0<z<l  for  which  y  >  0.  =^> 
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12.  Gauss's  Law  says  that  the  net  charge,  Q,  enclosed  by  a  closed  surface,  5,  is 

Q  =  eo  JJ  E-NdS 

where  E  is  an  electric  field  and  eo  (the  permittivity  of  free  space)  is  a  known  constant;  N  is 
oriented  outward.  Use  Gauss's  Law  to  find  the  charge  contained  in  the  cube  with  vertices 
(±1,  ±1,  ±1)  if  the  electric  field  is  E  =  {x,  y,  z).  =^ 

16.8    Stokes's  Theorem 

Recall  that  one  version  of  Green's  Theorem  (see  equation  16.5.1)  is 


/    F-rfr  =  //(V  X  F)-kdA. 

JdD  J  J 


D 

Here  D  is  a  region  in  the  x-y  plane  and  k  is  a  unit  normal  to  D  at  every  point.  If  D  is 
instead  an  orientable  surface  in  space,  there  is  an  obvious  way  to  alter  this  equation,  and 
it  turns  out  still  to  be  true: 

THEOREM  16.8.1  Stokes's  Theorem  Provided  that  the  quantities  involved  are 
sufficiently  nice,  and  in  particular  if  D  is  orientable. 


[    Fdr=  [[{V  X  F)-N  dS, 

JdD  J  J 


D 

if  dD  is  oriented  counter-clockwise  relative  to  N.  □ 

Note  how  little  has  changed:  k  becomes  N,  a  unit  normal  to  the  surface,  and  dA 
becomes  dS,  since  this  is  now  a  general  surface  integral.  The  phrase  "counter-clockwise 
relative  to  N"  means  that  if  we  take  the  direction  of  N  to  be  "up" ,  then  we  go  around  the 
boundary  counter-clockwise  when  viewed  from  "above" . 


EXAMPLE  16.8.2  Let  F  =  (e^^  cos  z,  x^z,  xy)  and  the  surface  Dhex=  ^l-y"^  -  2^, 
oriented  in  the  positive  x  direction.  It  quickly  becomes  apparent  that  the  surface  integral 
in  Stokes's  Theorem  is  intractable,  so  we  try  the  line  integral.  The  boundary  of  D  is  the 
unit  circle  in  the  y-z  plane,  r  =  (0,  cosm,  sinw),  0  <  w  <  271.  The  integral  is 

^271-  i'2-K 

I     {e^^  COS  z.x"^ z,xy)  ■  {0,  —  smu.cos  u)  du  =  /     Odu  =  0, 
Jo  Jo 

because  a;  =  0.  □ 
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An  interesting  consequence  of  Stokes's  Theorem  is  that  if  D  and  E  are  two  orientable 
surfaces  with  the  same  boundary,  then 

{VxF)-NdS=[   F-dr=  [   F  ■  dr  ^  [  [  {V  x  F)  ■  N  dS. 

JdD  JdE  J  J 

D  E 

Sometimes  both  of  the  integrals 


//(V  X  F) 


D 


■NdS       and        /  F-dr 

dD 


are  difficult,  but  you  may  be  able  to  find  a  second  surface  E  so  that 

NdS 


II  iV  X  F) 


E 

has  the  same  value  but  is  easier  to  compute. 

EXAMPLE  16.8.3  In  the  previous  example,  the  line  integral  was  easy  to  compute. 
But  we  might  also  notice  that  another  surface  E  with  the  same  boundary  is  the  flat  disk 
y'^  +  z'^  <  1.  The  unit  normal  N  for  this  surface  is  simply  i  =  (1,  0,  0).  We  compute  the 
curl: 

V  X  F  =  (a;  -  a;^,  -e^^sinz  -  y,2xz  -  xe'^^  cosz). 
Since  x  —  0  everywhere  on  the  surface, 

(V  X  F)  •  N  =  (0,  -e^^  sin  z-y,  2xz  -  xe^^  cos     •  (1,  0, 0)  =  0, 

so  the  surface  integral  is 

JJods  =  o, 

E 

as  before.  In  this  case,  of  course,  it  is  still  somewhat  easier  to  compute  the  line  integral, 
avoiding  V  x  F  entirely.  □ 

EXAMPLE  16.8.4  Let  F  =  {-y^,x,  z^),  and  let  the  curve  C  be  the  intersection  of  the 
cylinder  x^  +  y^  =  1  with  the  plane  y  +  z  =  2,  oriented  counter-clockwise  when  viewed 

from  above.  We  compute  /  F  •      in  two  ways. 

Jc 
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First  we  do  it  directly:  a  vector  function  for  C  is  r  =  (cos  tt,  sin  tt,  2  —  sintt),  so 
r'  =  (— sinMjCOSM,  —  costt),  and  the  integral  is  then 


27r  /.27r 

2  ,   ^„„„„.       „2  „„„„.-._    /       „•  3, 


y  sin  u  -\-  X  cos  u  —  z  cos  udu  —  /     sin  u  +  cos  u  —  {2  —  sin  u)  cos  udu  —  tt. 

0 

To  use  Stokes's  Theorem,  we  pick  a  surface  with  C  as  the  boundary;  the  simplest 
such  surface  is  that  portion  of  the  plane  y  +  z  =  2  inside  the  cylinder.  This  has  vector 
equation  r  =  {vcosu,vsmu,2  —  vsinu).  We  compute  r„  =  {—vsmu,vcosu,—vcosu), 
Ty  =  (costt,sintt,  —  sinw),  and  r^j  x  r^,  =  {0,  —v,  —v) .  To  match  the  orientation  of  C  we 
need  to  use  the  normal  (0,  v,  v).  The  curl  of  F  is  (0,  0, 1  +  2y)  =  (0,  0,l  +  2v  sintt),  and  the 
surface  integral  from  Stokes's  Theorem  is 

/'27T      I*  1 

/      /  {1 -\-2vsm.u)v  dv  du  =  TT. 
Jo  Jo 

In  this  case  the  surface  integral  was  more  work  to  set  up,  but  the  resulting  integral  is 
somewhat  easier.  □ 

Proof  of  Stokes's  Theorem.  We  can  prove  here  a  special  case  of  Stokes's  Theorem, 
which  perhaps  not  too  surprisingly  uses  Green's  Theorem. 

Suppose  the  surface  D  of  interest  can  be  expressed  in  the  form  z  =  g{x,y),  and  let 
F  =  {P,Q,R).  Using  the  vector  function  r  =  {x,y,  g{x,y))  for  the  surface  we  get  the 
surface  integral 

JJ  VxF-dS  =  JJ{Ry-Q„P,-R^,Q^-Py)-{-g^,-gy,l)dA 

D  E 

=  J  J  -Ry9x  +  Qz9x  -  Pz9y  +  RxQy  +  Qx  -  Py  dA. 
E 

Here  E  is  the  region  in  the  x-y  plane  directly  below  the  surface  D. 

For  the  line  integral,  we  need  a  vector  function  for  dD.  If  {x{t),y(t))  is  a  vector 
function  for  dE  then  we  may  use  r{t)  =  {x{t),y{t), g{x{t),y{t)))  to  represent  dD.  Then 

Jqjj  7^      dt        dt        dt         Jg^      dt        dt         \ox  dt      oy  dt  J 

using  the  chain  rule  for  dz/dt.  Now  we  continue  to  manipulate  this: 

^dx     ^dy    ^  f  dz  dx     dz  dy\  , 


dt 


dt        dt       \dx  dt      dy  dt 


dE 


dz  \  I  dz  \ 

P  +  R—        +    Q  +  R—  dy, 


dx  J  \  dy 
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which  now  looks  just  hke  the  hne  integral  of  Green's  Theorem,  except  that  the  functions 
P  and  Q  of  Green's  Theorem  have  been  replaced  by  the  more  complicated  P  +  R{dz/dx) 
and  Q  +  R{dz/dy).  We  can  apply  Green's  Theorem  to  get 


dE 


P  +  R^)  dx+fq  +  R^)  dy=  [[  ^(q  +  R^)  -  I  (P  +  R^^]  dA. 


dx  J  \  dy  J  J  J  dx  \  dy  J     dy  \  dx 


E 


Now  we  can  use  the  chain  rule  again  to  evaluate  the  derivatives  inside  this  integral,  and  it 
becomes 

Qx  +  QzQx  +  RxQy  +  Rz9x9y  +  RQyx  -  {Py  +  PzQy  +  RyQx  +  RzQyQx  +  RQxy)  dA 
Qx      Qz9x      Rx9y  -Pz9y      Ry9x  dA, 


E 


E 

which  is  the  same  as  the  expression  we  obtained  for  the  surface  integral.  ■ 
Exercises  16.8. 

1.  Let  F  =  {z,x,y).  The  plane  z  =  2x  +  2y  —  1  and  the  paraboloid  z  =      +  y"^  intersect  in  a 
closed  curve.  Stokes's  Theorem  implies  that 

JJ{V  xF)  -NdS  =  j  F-dY  =  JJ  {V  xF)  -  -NdS, 

where  the  line  integral  is  computed  over  the  intersection  C  of  the  plane  and  the  paraboloid, 
and  the  two  surface  integrals  are  computed  over  the  portions  of  the  two  surfaces  that  have 
boundary  C  (provided,  of  course,  that  the  orientations  all  match).  Compute  all  three  inte- 
grals. 

2.  Let  D  be  the  portion  oi  z  =  1  —      —       above  the  x-y  plane,  oriented  up,  and  let  F  = 
{xy^,-x'^y,xyz).  Compute  J  J  (V  x  F)  •NdS'.  =^ 

D 

3.  Let  D  be  the  portion  oi  z  =  2x  +  5y  inside      +      =  1,  oriented  up,  and  let  F  =  (y,  z,  —x). 
Compute  /     F  •  dr.  =^ 

JdD 

4.  Compute  (p  x^ z  dx  +  3xdy  —  y^  dz,  where  C  is  the  unit  circle  x^  +  y'^  =  1  oriented  counter- 


Jc 

clockwise.  =^> 


5.  Let  D  be  the  portion  oi  z  =  px  +  qy  +  r  over  a  region  in  the  x-y  plane  that  has  area  A, 
oriented  up,  and  let  F  =  {ax  +  by  +  cz,  ax  +  by  +  cz,  ax  +  by  +  cz).  Compute  /     F  •  dr.  =^ 

JdD 

6.  Let  D  be  any  surface  and  let  F  =  {P(x),Q{y),  R{z))  {P  depends  only  on  x,  Q  only  on  y, 
and  R  only  on  z).  Show  that  /     F  •  dr  =  0. 

JdD 
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7.  Show  that   /  fVg  +  gV f  •  dr  =  0,  where  r  describes  a  closed  curve  C  to  which  Stokes's 
Jc 

Theorem  appHes. 

16.9    The  Divergence  Theorem 

The  third  version  of  Green's  Theorem  (equation  16.5.2)  we  saw  was: 

F-Ncis=  /  /  V-Fc/A 


D 


'dD 

With  minor  changes  this  turns  into  another  equation,  the  Divergence  Theorem: 


THEOREM  16.9.1  Divergence  Theorem  Under  suitable  conditions,  if  i?  is  a 
region  of  three  dimensional  space  and  D  is  its  boundary  surface,  oriented  outward,  then 

F-NdS  =  jjj  V  -YdV. 

D  E 

Proof.  Again  this  theorem  is  too  difficult  to  prove  here,  but  a  special  case  is  easier.  In 
the  proof  of  a  special  case  of  Green's  Theorem,  we  needed  to  know  that  we  could  describe 
the  region  of  integration  in  both  possible  orders,  so  that  we  could  set  up  one  double  integral 
using  dxdy  and  another  using  dydx.  Similarly  here,  we  need  to  be  able  to  describe  the 
three-dimensional  region  E  in  different  ways. 
We  start  by  rewriting  the  triple  integral: 

V-FdV  =  JJJ{P:.  +  Qy  +  R,)dV  =  JJJ  P,dV  +  JJJ  QydV  +  JJJ  R.dV. 

EE  E  E  E 

The  double  integral  may  be  rewritten: 

J  J  ¥-^dS  =  J  J  (Pi  +  Qj  +  i?k)-Nrf5  =  J  J  P\-^dS  +  J  J  Qy^dS  +  J  J  Rk-NdS. 

D  D  D  D  D 

To  prove  that  these  give  the  same  value  it  is  sufficient  to  prove  that 

Pi-NdS  =  JJJ  P^dV, 

D  E 

Q]-NdS  =  JJJ  QydV,  and  (16.9.1) 

D  E 


II 


Rk-NdS  =  JJJ  R,dV. 

D  E 

Not  surprisingly,  these  are  all  pretty  much  the  same;  we'll  do  the  first  one. 
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We  set  the  triple  integral  up  with  dx  innermost: 

''92{y,z) 
' 9i{y,z) 


rrr  rr   r92{y,z)  t- t- 

P^dV=  //   /  P^dxdA=  P{g2iy,z),y,z)-P{gi{y,z),y,z)dA, 

J  J  J  J  J     J      ixi^z)  J  J 


E  B 

where  B  is  the  region  in  the  y-z  plane  over  which  we  integrate.  The  boundary  surface  of 
E  consists  of  a  "top"  x  =  g2{y,z),  a  "bottom"  x  =  gi{y,z),  and  a  "wrap-around  side" 
that  is  vertical  to  the  y-z  plane.  To  integrate  over  the  entire  boundary  surface,  we  can 
integrate  over  each  of  these  (top,  bottom,  side)  and  add  the  results.  Over  the  side  surface, 
the  vector  N  is  perpendicular  to  the  vector  i,  so 

Pi-NdS^  JJ  OdS  =  0. 

side  side 

Thus,  we  are  left  with  just  the  surface  integral  over  the  top  plus  the  surface  integral 
over  the  bottom.  For  the  top,  we  use  the  vector  function  r  —  {g2{y,z),y^z)  which  gives 
X     =  (1,  —g2yi  —g2z)',  the  dot  product  of  this  with  i  =  (1,  0, 0)  is  1.  Then 


top  B 

In  almost  identical  fashion  we  get 


JJ  Pi-NdS  =  JJ  P{g2{y,z),y,z)dA. 


JJ    Pi-NdS^- JJ  P{g,{y,z),y,z)dA, 


bottom  B 

where  the  negative  sign  is  needed  to  make  N  point  in  the  negative  x  direction.  Now 


Pi-NdS^  JJ  P{g2iy,z),y,z)dA- J  J  P{g^{y,z),y,z)dA, 

D  B  B 

which  is  the  same  as  the  value  of  the  triple  integral  above.  ■ 

EXAMPLE  16.9.2     Let  F  =  (2x,3y,2^),  and  consider  the  three-dimensional  volume 
inside  the  cube  with  faces  parallel  to  the  principal  planes  and  opposite  corners  at  (0,  0,  0) 
and  (1,1,1).  We  compute  the  two  integrals  of  the  divergence  theorem. 
The  triple  integral  is  the  easier  of  the  two: 

/    /  2  +  Z  +  2zdxdydz  =  Q. 
0  Jo  Jo 

The  surface  integral  must  be  separated  into  six  parts,  one  for  each  face  of  the  cube.  One 
face  is  2;  =  0  or  r  =  (tt,    0),  0  <  u,v  <  1.   Then       =  (1,0,0),  r^,  =  (0,1,0),  and 
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X  r^,  =  (0,0,1).  We  need  this  to  be  oriented  downward  (out  of  the  cube),  so  we  use 
(0,0,  —1)  and  the  corresponding  integral  is 

1   .1  .1  .1 

/    —z  dudv  =        /    Odudv  =  0. 
0  Jo  Jo  Jo 

Another  face  is  y  =  1  or  r  =  (w,  1,  v).  Then  =  (1, 0, 0),  r„  =  (0,  0, 1),  and  r^^  x  = 
(0,  —  1,  0).  We  need  a  normal  in  the  positive  y  direction,  so  we  convert  this  to  (0, 1,0),  and 
the  corresponding  integral  is 

1      r-l  r-l  r-1 

/    3ydudv=        /   3  dudv  =  3. 
0  Jo  Jo  Jo 

The  remaining  four  integrals  have  values  0,  0,  2,  and  1,  and  the  sum  of  these  is  6,  in 
agreement  with  the  triple  integral.  □ 

EXAMPLE  16.9.3  Let  F  =  {x^,  y^,  2;^),  and  consider  the  cylindrical  volume  x'^+y'^  <  9, 
0  <  2;  <  2.  The  triple  integral  (using  cylindrical  coordinates)  is 


/  /  /  {3r'^ +  2z)rdzdrdd  =  2797r. 
Jo    Jo  Jo 


For  the  surface  we  need  three  integrals.  The  top  of  the  cylinder  can  be  represented 
by  r  =  {v  cosM,  v  smu,  2);  x  r^,  =  (0,  0,  —v),  which  points  down  into  the  cylinder,  so  we 
convert  it  to  (0,0,^;).  Then 

/»27r     rS  /'27r 

/      /  {v^  cos^  u,     sin^  u,  4:)  ■  {0,0,  v)dvdu=  /      /  4:vdvdu^36n. 
Jo    Jo  Jo  Jo 

The  bottom  is  r  =  {v  cos  u,  v  sin  u,  0) ;  r„  x      =  (0,  0,  —v)  and 

/      /  {v^  cos^  u,     sin^  u,  0)  •  (0,  0,  -v)  dv  du  ^         /  Odvdu^O. 
Jo    Jo  Jo  Jo 

The  side  of  the  cylinder  is  r  =  (Scosm,  3  sinw,  w );  x  r„  =  (Scosw,  Ssin-u,  0)  which  does 
point  outward,  so 

/      /  {27 cos^  u,  27 sin^  u,v^)  ■  {3cosu,3smu,0)  dvdu 
Jo  Jo 

f-2-n:  f-2 

=         /   Slcos^M  +  Slsin^Mdvdw  =  2437r. 
JO  Jo 

The  total  surface  integral  is  thus  367r  +  0  +  2437r  =  2797r.  □ 
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Exercises  16.9. 

1.  Using  F  =  (3x,  y^,  —2z^)  and  the  region  bounded  by     +     =  9,  2;  =  0,  and  z  =  5,  compute 
both  integrals  from  the  Divergence  Theorem. 

2.  Let  E  be  the  volume  described  byO<2;<a,  0<y<6,  0<z<c,  and  F  =  (a;^,  j/^,  2^). 
Compute  [ I  F  •  N  dS.  ^ 


9£; 


3.  Let  ii^  be  the  volume  described  byO<x<l,0<y<l,0<z<l,  and  F  = 
(2xy,3xy,2e"+^).  Compute  jj  F-NdS.  ^ 

4.  Let  E  be  the  volume  described  byO<x<l,  0<y<a;,  0<z<a;  +  y,  and  F  =  (x,  2y,  3^;). 
Compute        F-Nd5.  ^ 

5.  Let  E  be  the  volume  described  by      +  y'^  +      <  4,  and  F  =  {x^ ,  y^,  z^).  Compute  j j  F  • 

6.  Let  ii^  be  the  hemisphere  described  by  0  <  2;  <  -^1  —  a;^  —  y^,  and 

F  =  (^x2  +y2  +  z2^         +y2  +z2,  ^x2  +  y2  +  22).  Compute  yj  F  ■  N  dS.  ^ 


dE 

7.  Let  iiJ  be  the  volume  described  by  a;  +y   <1,0<2;<4,  and  F  =  {xy  ,  yz,  x  z).  Compute 
F-NdS. 


dE 


8.  Let  E  be  the  solid  cone  above  the  x-y  plane  and  inside  z  =  1  —  \/ x"^  +  y^,  and  F 
(x  cos^  2,  y  sin^  z,  ^x'^  +  y'^z) .  Compute  jj  F-NdS. 


dE 


9.  Prove  the  other  two  equations  in  the  display  16.9.1. 
10.  Suppose  D  is  a  closed  surface,  and  that  D  and  F  are  sufficiently  nice.  Show  that 

'VxF)-Nd5  =  0 


D 

where  N  is  the  outward  pointing  unit  normal. 

11.  Suppose  D  is  a  closed  surface,  D  is  sufficiently  nice,  and  F  =  {a,b,c)  is  a  constant  vector 
field.  Show  that 

jj  F-NdS  =  0 

D 

where  N  is  the  outward  pointing  unit  normal. 

12.  We  know  that  the  volume  of  a  region  E  may  often  be  computed  as  jjj  dxdydz.  Show  that 

E 

this  volume  may  also  be  computed  as  ^  jj  {x,y,  z)      dS  where  N  is  the  outward  pointing 

dE 

unit  normal  to  dE. 


17 

Differential  Equations 


Many  physical  phenomena  can  be  modeled  using  the  language  of  calculus.  For  example, 
observational  evidence  suggests  that  the  temperature  of  a  cup  of  tea  (or  some  other  liquid) 
in  a  room  of  constant  temperature  will  cool  over  time  at  a  rate  proportional  to  the  difference 
between  the  room  temperature  and  the  temperature  of  the  tea. 

In  symbols,  if  t  is  the  time,  M  is  the  room  temperature,  and  f{t)  is  the  temperature 
of  the  tea  at  time  t  then  f'{t)  =  k{M  —  f{t))  where  /c  >  0  is  a  constant  which  will  depend 
on  the  kind  of  tea  (or  more  generally  the  kind  of  liquid)  but  not  on  the  room  temperature 
or  the  temperature  of  the  tea.  This  is  Newton's  law  of  cooling  and  the  equation  that 
we  just  wrote  down  is  an  example  of  a  differential  equation.  Ideally  we  would  like  to 
solve  this  equation,  namely,  find  the  function  f(t)  that  describes  the  temperature  over 
time,  though  this  often  turns  out  to  be  impossible,  in  which  case  various  approximation 
techniques  must  be  used.  The  use  and  solution  of  differential  equations  is  an  important 
field  of  mathematics;  here  we  see  how  to  solve  some  simple  but  useful  types  of  differential 
equation. 

Informally,  a  differential  equation  is  an  equation  in  which  one  or  more  of  the  derivatives 
of  some  function  appear.  Typically,  a  scientific  theory  will  produce  a  differential  equation 
(or  a  system  of  differential  equations)  that  describes  or  governs  some  physical  process,  but 
the  theory  will  not  produce  the  desired  function  or  functions  directly. 

Recall  from  section  6.2  that  when  the  variable  is  time  the  derivative  of  a  function  y(t) 
is  sometimes  written  as  y  instead  of  y';  this  is  quite  common  in  the  study  of  differential 
equations. 
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17.1    First  Order  Differential  Equations 

We  start  by  considering  equations  in  which  only  the  first  derivative  of  the  function  appears. 

DEFINITION  17.1.1  A  first  order  differential  equation  is  an  equation  of  the  form 
F{t,  y,  y)  =  0.  A  solution  of  a  first  order  differential  equation  is  a  function  f{t)  that  makes 
F{t^  f{t),  f'{t))  =  0  for  every  value  of  t.  a 

Here,  F  is  a  function  of  three  variables  which  we  label  t,  and  y.  It  is  understood 
that  y  will  explicitly  appear  in  the  equation  although  t  and  y  need  not.  The  term  "first 
order"  means  that  the  first  derivative  of  y  appears,  but  no  higher  order  derivatives  do. 

EXAMPLE  17.1.2  The  equation  from  Newton's  law  of  cooling,  y  =  k{M  —  y)  is  a  first 
order  differential  equation;  F{t,y,y)  =  k{M  —  y)  —  y.  □ 

EXAMPLE  17.1.3  ij  =  +  1  is  a  first  order  differential  equation;  F{t,y,y)  =  y-t^-l. 
All  solutions  to  this  equation  are  of  the  form  t"^/3  -\-t  -\-  C.  □ 

DEFINITION  17.1.4  A  first  order  initial  value  problem  is  a  system  of  equations 
of  the  form  F{t,y,y)  =  0,  y{to)  =  j/o-  Here  to  is  a  fixed  time  and  yo  is  a  number.  A 
solution  of  an  initial  value  problem  is  a  solution  f{t)  of  the  differential  equation  that  also 
satisfies  the  initial  condition  /(to)  =  Vo-  □ 

EXAMPLE  17.1.5  The  initial  value  problem  y  =  +  1,  y(l)  =  4  has  solution  f{t)  = 
t^/3  +  t +  8/3.  □ 

The  general  first  order  equation  is  rather  too  general,  that  is,  we  can't  describe  methods 
that  will  work  on  them  all,  or  even  a  large  portion  of  them.  We  can  make  progress  with 
specific  kinds  of  first  order  differential  equations.  For  example,  much  can  be  said  about 
equations  of  the  form  y  =  (t){t,  y)  where  0  is  a  function  of  the  two  variables  t  and  y.  Under 
reasonable  conditions  on  such  an  equation  has  a  solution  and  the  corresponding  initial 
value  problem  has  a  unique  solution.  However,  in  general,  these  equations  can  be  very 
difficult  or  impossible  to  solve  explicitly. 

EXAMPLE  17.1.6  Consider  this  specific  example  of  an  initial  value  problem  for  New- 
ton's law  of  cooling:  y  =  2(25  —  y),  y{0)  =  40.  We  first  note  that  if  y{to)  =  25,  the  right 
hand  side  of  the  differential  equation  is  zero,  and  so  the  constant  function  y{t)  =  25  is  a 
solution  to  the  differential  equation.  It  is  not  a  solution  to  the  initial  value  problem,  since 
y{0)  7^  40.  (The  physical  interpretation  of  this  constant  solution  is  that  if  a  liquid  is  at 
the  same  temperature  as  its  surroundings,  then  the  liquid  will  stay  at  that  temperature.) 
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So  long  as  y  is  not  25,  we  can  rewrite  the  differential  equation  as 

dy  I 


dt25-y 
1 


25 -y 

so 


dy  =  2  dt, 


^     dy^  I  2  dt, 


25 -y 

that  is,  the  two  anti-dcrivativcs  must  be  the  same  except  for  a  constant  difference.  We  can 
calculate  these  anti-derivatives  and  rearrange  the  results: 


(-l)ln|25-y|  =  2t  +  Cq 

In  |25  -y\  =  -2t  -  Cq  = -2t  +  C 

|25-y|=e-^*+^  =  e-2*e^ 


y-25  =  ±e^e~^* 


y  =  25±  =  25  +  Ae'^*. 

Here  A  —  ±e'~^  =  ±e~*^°  is  some  non-zero  constant.  Since  we  want  y(0)  =  40,  we 
substitute  and  solve  for  A: 

40  =  25  + 
15  =  A, 

and  soy  =  25  +  15e~^*  is  a  solution  to  the  initial  value  problem.  Note  that  y  is  never  25,  so 
this  makes  sense  for  all  values  of  t.  However,  if  we  allow  ^  =  0  we  get  the  solution  y  =  25 
to  the  differential  equation,  which  would  be  the  solution  to  the  initial  value  problem  if  we 
were  to  require  y(0)  =  25.  Thus,  y  =  25  +  Ae~'^*  describes  all  solutions  to  the  differential 
equation  y  =  2(25  —  y),  and  all  solutions  to  the  associated  initial  value  problems.  □ 

Why  could  we  solve  this  problem?  Our  solution  depended  on  rewriting  the  equation 
so  that  all  instances  of  y  were  on  one  side  of  the  equation  and  all  instances  of  t  were  on  the 
other;  of  course,  in  this  case  the  only  t  was  originally  hidden,  since  we  didn't  write  dy/dt 
in  the  original  equation.  This  is  not  required,  however. 
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EXAMPLE  17.1.7  Solve  the  differential  equation  y  =  2t{25  -  y).  This  is  almost 
identical  to  the  previous  example.  As  before,  y{t)  =  25  is  a  solution.  \i  y  ^  25, 


(-l)ln|25-y|  =t2  +  Co 


In  |25  -y\  =  -t^  -  Cq  = -t^  +  C 
|25-y|  =e"*'+^  =  e"*'e^ 
y  -  25  =  ±  e^e~*' 

2/  =  25  ±  e^e-*'  =  25  +  Ae"*' . 

_  ,2 

As  before,  all  solutions  are  represented  by  y  =  25  +  ,  allowing  A  to  be  zero.  □ 

DEFINITION  17.1.8  A  first  order  differential  equation  is  separable  if  it  can  be  written 
in  the  form  y  =  f(t)g{y).  □ 

As  in  the  examples,  we  can  attempt  to  solve  a  separable  equation  by  converting  to  the 
form 


This  technique  is  called  separation  of  variables.  The  simplest  (in  principle)  sort  of 
separable  equation  is  one  in  which  g{y)  =  1,  in  which  case  we  attempt  to  solve 


j^dy  =  j  f{t)dt. 


We  can  do  this  if  we  can  find  an  anti-derivative  of  f{t). 

Also  as  we  have  seen  so  far,  a  differential  equation  typically  has  an  infinite  number 
of  solutions.  Ideally,  but  certainly  not  always,  a  corresponding  initial  value  problem  will 
have  just  one  solution.  A  solution  in  which  there  are  no  unknown  constants  remaining  is 
called  a  particular  solution. 

The  general  approach  to  separable  equations  is  this:  Suppose  we  wish  to  solve  y  = 
fit) gin)  where  /  and  g  are  continuous  functions.  If  g{a)  =  0  for  some  a  then  y{t)  =  a 
is  a  constant  solution  of  the  equation,  since  in  this  case  y  =  0  =  f{t)g{a).  For  example, 
y  =  y"^  —  1  has  constant  solutions  y{t)  =  1  and  y{t)  —  —1. 

To  find  the  nonconstant  solutions,  we  note  that  the  function  l/g{y)  is  continuous  where 
g  ^  0,  so  1/g  has  an  antiderivative  G.  Let  F  be  an  antiderivative  of  /.  Now  we  write 

G{y)  ^  J  -^-^dy^J  m  dt  =  F{t)  +  C, 

so  G{y)  =  F{t)  +  C.  Now  we  solve  this  equation  for  y. 
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Of  course,  there  are  a  few  places  this  ideal  description  could  go  wrong:  we  need  to 
be  able  to  find  the  antiderivatives  G  and  F,  and  we  need  to  solve  the  final  equation  for 
y.  The  upshot  is  that  the  solutions  to  the  original  differential  equation  are  the  constant 
solutions,  if  any,  and  all  functions  y  that  satisfy  G{y)  =  F{t)  +  C. 

EXAMPLE  17.1.9  Consider  the  differential  equation  y  =  ky.  When  /c  >  0,  this  de- 
scribes certain  simple  cases  of  population  growth:  it  says  that  the  change  in  the  population 
y  is  proportional  to  the  population.  The  underlying  assumption  is  that  each  organism  in 
the  current  population  reproduces  at  a  fixed  rate,  so  the  larger  the  population  the  more 
new  organisms  are  produced.  While  this  is  too  simple  to  model  most  real  populations,  it 
is  useful  in  some  cases  over  a  limited  time.  When  /c  <  0,  the  differential  equation  describes 
a  quantity  that  decreases  in  proportion  to  the  current  value;  this  can  be  used  to  model 
radioactive  decay. 

The  constant  solution  is  y{t)  =  0;  of  course  this  will  not  be  the  solution  to  any 
interesting  initial  value  problem.  For  the  non-constant  solutions,  we  proceed  much  as 
before: 


/- 

/  y 

dy 

=  J  kdt 

In 

\y\ 

=  kt  +  C 

\y\ 

=  e'^'e^ 

y 

y 

=  Ae^*. 

Again,  if  we  allow  A  =  0  this  includes  the  constant  solution,  and  we  can  simply  say  that 
y  =  Ae^^  is  the  general  solution.  With  an  initial  value  we  can  easily  solve  for  A  to  get 
the  solution  of  the  initial  value  problem.  In  particular,  if  the  initial  value  is  given  for  time 
t  —  0,  y{0)  =  yo,  then  A  =  yo  and  the  solution  is  y  =  yoe'^^.  □ 

Exercises  17.1. 

1.  Which  of  the  following  equations  are  separable? 

a.  y  =  sin(ty) 

b.  ij  =  e 

c.  yy  =  t 

d.  y  =  (t^  —  t)  arcsin(y) 

e.  y  =     In  y  +  4t^  In  y 

2.  Solve  y  =  1/(1  +  ^ 

3.  Solve  the  initial  value  problem  ij  =  t"'  with  y(0)  =  1  and  n  >  0.  =^ 

4.  Solve  y  =  In  t.  =^ 
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5.  Identify  the  constant  solutions  (if  any)  of  y  =  tsiny.  =^ 

6.  Identify  the  constant  solutions  (if  any)  of  y  =  te^ .  =^ 

7.  Solve  y  =  t/y.  =^ 

8.  Solve  y  =  y^  -  1. 

9.  Solve  y  =  t/{y^  -  5).  You  may  leave  your  solution  in  implicit  form:  that  is,  you  may  stop 
once  you  have  done  the  integration,  without  solving  for  y.  =^ 

10.  Find  a  non-constant  solution  of  the  initial  value  problem  y  =  y^^^,  y(0)  =  0,  using  separation 
of  variables.  Note  that  the  constant  function  y(t)  =  0  also  solves  the  initial  value  problem. 
This  shows  that  an  initial  value  problem  can  have  more  than  one  solution.  =^> 

11.  Solve  the  equation  for  Newton's  law  of  cooling  leaving  M  and  k  unknown.  =^> 

12.  After  10  minutes  in  Jean-Luc's  room,  his  tea  has  cooled  to  40°  Celsius  from  100°  Celsius. 
The  room  temperature  is  25°  Celsius.  How  much  longer  will  it  take  to  cool  to  35°?  =^ 

13.  Solve  the  logistic  equation  y  =  ky[M  —  y).  (This  is  a  somewhat  more  reasonable  population 
model  in  most  cases  than  the  simpler  y  =  ky.)  Sketch  the  graph  of  the  solution  to  this 
equation  when  M  =  1000,  k  =  0.002,  y(0)  =  1.  ^ 

14.  Suppose  that  y  =  ky,  y{0)  =  2,  and  y(0)  =  3.  What  is  y?  =^> 

15.  A  radioactive  substance  obeys  the  equation  y  =  ky  where  k  <  0  and  y  is  the  mass  of  the 
substance  at  time  t.  Suppose  that  initially,  the  mass  of  the  substance  is  y(0)  =  M  >  0.  At 
what  time  does  half  of  the  mass  remain?  (This  is  known  as  the  half  life.  Note  that  the  half 
life  depends  on  k  but  not  on  M.)  =^> 

16.  Bismuth-210  has  a  half  life  of  five  days.  If  there  is  initially  600  milligrams,  how  much  is  left 
after  6  days?  When  will  there  be  only  2  milligrams  left? 

17.  The  half  life  of  carbon-14  is  5730  years.  If  one  starts  with  100  milligrams  of  carbon-14, 
how  much  is  left  after  6000  years?  How  long  do  we  have  to  wait  before  there  is  less  than  2 
milligrams? 

18.  A  certain  species  of  bacteria  doubles  its  population  (or  its  mass)  every  hour  in  the  lab.  The 
differential  equation  that  models  this  phenomenon  is  y  =  ky,  where  fc  >  0  and  y  is  the 
population  of  bacteria  at  time  t.  What  is  y?  =^ 

19.  If  a  certain  microbe  doubles  its  population  every  4  hours  and  after  5  hours  the  total  population 
has  mass  500  grams,  what  was  the  initial  mass?  =^> 

17.2    First  Order  Homogeneous  Linear  Equations 

A  simple,  but  important  and  useful,  type  of  separable  equation  is  the  first  order  homo- 
geneous linear  equation: 

DEFINITION  17.2.1  A  first  order  homogeneous  hnear  differential  equation  is  one  of 
the  form  y  +  p{t)y  =  0  or  equivalently  y  =  —p{t)y.  □ 

"Linear"  in  this  definition  indicates  that  both  y  and  y  occur  to  the  first  power;  "ho- 
mogeneous" refers  to  the  zero  on  the  right  hand  side  of  the  first  form  of  the  equation. 


17.2    First  Order  Homogeneous  Linear  Equations  459 


EXAMPLE  17.2.2  The  equation  y  =  2t(25  -  y)  can  be  written  y  +  2ty  =  50t.  This 
is  hnear,  but  not  homogeneous.  The  equation  y  =  ky,  or  y  —  ky  =  0  is  hnear  and 
homogeneous,  with  a  particularly  simple  p{t)  =  —k.  □ 

Because  first  order  homogeneous  linear  equations  are  separable,  we  can  solve  them  in 
the  usual  way: 

y  =  -p{t)y 

J^dy^  J-p{t)dt 

\n\y\  =  P{t)  +  C 

where  P{t)  is  an  anti-derivative  of  —p{t).  As  in  previous  examples,  if  we  allow  ^  =  0  we 
get  the  constant  solution  y  =  0. 

EXAMPLE  17.2.3  Solve  the  initial  value  problems  y  +  ycost  =  0,  y{0)  =  1/2  and 
y{2)  =  1/2.  We  start  with 

P{t)  =  J —  costdt  = —sint, 
so  the  general  solution  to  the  differential  equation  is 


To  compute  A  we  substitute: 


so  the  solutions  is 


For  the  second  problem. 


so  the  solution  is 


1  -sint_ 
^  2 


A  —  lpSin2 


y=  ig^i'^^  -sint_ 

^2  □ 
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EXAMPLE  17.2.4  Solve  the  initial  value  problem  yy  +  3y  =  0,  y{l)  =  2,  assuming 
t  >  0.  We  write  the  equation  in  standard  form:  y  +  3y/t  =  0.  Then 


=  -31nt 


and 

y  =  Ae-^^""^  =  At-^. 

Substituting  to  find  A:  2  =  A{1)~^  =  A,  so  the  solution  is  y  =  2t~^.  □ 
Exercises  17.2. 

Find  the  general  solution  of  each  equation  in  1-4. 

1.  y  +  5y  =  0  ^ 

2.  y  -  2y  =  0  ^ 

4.  y  +  t'^y  =  0^ 

In  5-14,  solve  the  initial  value  problem. 

5.  y  +  y  =  0,  y(0)  =  4  ^ 

6.  y-3y  =  0,  y(l)  =  -2^ 

7.  y  +  ysint  =  0,  y(7r)  =  1  =^ 

8.  y  +  ye*  =  0,  y(0)  =  e  ^ 

9.  y  +  yVl  +  t4  =  0,  y(0)  =  0^ 

10.  y  +  ycos(e*)  =  0,  y(0)  =  0  ^ 

11.  ty-2y  =  0,  y(l)  =  4^ 

12.  i^y  +  y^o,  y(l)  =  -2,  t>0^ 

13.  i'y  =  2y,  y(l)  =  1,  t  >  0  ^ 

14.  i'y  =  2y,  y(l)  =  0,  t  >  0  ^ 

15.  A  function  y{t)  is  a  solution  of  y  +  fey  =  0.  Suppose  that  y(0)  =  100  and  y(2)  =  4.  Find  k 
and  find  y{t). 

16.  A  function  y{t)  is  a  solution  of  y  +  t'^y  =  0.  Suppose  that  y(0)  =  1  and  y(l)  =  e^^^.  Find  k 
and  find  y{t).  =^ 

17.  A  bacterial  culture  grows  at  a  rate  proportional  to  its  population.  If  the  population  is  one 
million  at  t  =  0  and  1.5  million  at  t  =  1  hour,  find  the  population  as  a  function  of  time. 

18.  A  radioactive  element  decays  with  a  half-life  of  6  years.  If  a  mass  of  the  element  weighs  ten 
pounds  at  t  =  0,  find  the  amount  of  the  element  at  time  t. 
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17.3    First  Order  Linear  Equations 

As  you  might  guess,  a  first  order  linear  differential  equation  has  the  form  y  +  p{t)y  —  f{t). 
Not  only  is  this  closely  related  in  form  to  the  first  order  homogeneous  linear  equation,  we 
can  use  what  we  know  about  solving  homogeneous  equations  to  solve  the  general  linear 
equation. 

Suppose  that  yi{t)  and  j/2(^)  are  solutions  to  y  +  p{t)y  =  f{t).  Let  g{t)  =  yi  —  j/2- 
Then 

g'{t)  +p{t)g{t)  =  y[-  -  ya) 

=  {y[+p{t)yi)-{y2+p{t)y2) 
=  fit)  -  fit)  =  0. 

In  other  words,  git)  =  yi  —  y2  is  a,  solution  to  the  homogeneous  equation  y  +  pit)y  =  0. 
Turning  this  around,  any  solution  to  the  linear  equation  y  +  pit)y  =  fit),  call  it  yi,  can 
be  written  as  y2  +  git),  for  some  particular  y2  and  some  solution  git)  of  the  homogeneous 
equation  y+pit)y  —  0.  Since  we  already  know  how  to  find  all  solutions  of  the  homogeneous 
equation,  finding  just  one  solution  to  the  equation  y  +  pit)y  =  fit)  will  give  us  all  of  them. 

How  might  we  find  that  one  particular  solution  to  y+pit)y  =  /(t)?  Again,  it  turns  out 
that  what  we  already  know  helps.  We  know  that  the  general  solution  to  the  homogeneous 
equation  y  +  pit)y  =  0  looks  like  Ae^^^\  We  now  make  an  inspired  guess:  consider  the 
function  v(t)e^^*\  in  which  we  have  replaced  the  constant  parameter  A  with  the  function 
vit).  This  technique  is  called  variation  of  parameters.  For  convenience  write  this  as 
sit)  =  vit)hit)  where  hit)  =  e^^^^  is  a  solution  to  the  homogeneous  equation.  Now  let's 
compute  a  bit  with  sit): 

s'it)  +  pit)sit)  =  vit)h'it)  +  v'it)hit)  +  pit)vit)hit) 
=  vit)ih'it)  +  pit)hit))  +  v'it)hit) 
=  v' it)  hit). 

The  last  equality  is  true  because  h'it)  +  pit)hit)  =  0,  since  hit)  is  a  solution  to  the 
homogeneous  equation.  We  are  hoping  to  find  a  function  s(t)  so  that  s'it)  +  pit) sit)  — 
fit);  we  will  have  such  a  function  if  we  can  arrange  to  have  =  fit),  that  is, 

v'it)  =  fit)/hit).  But  this  is  as  easy  (or  hard)  as  finding  an  anti-derivative  of 
Putting  this  all  together,  the  general  solution  to  y  +  pit)y  =  fit)  is 

vit)hit)  +  Ae^^'^  =  vit)e^^'^  +  Ae^^'\ 

EXAMPLE  17.3.1  Find  the  solution  of  the  initial  value  problem  y  +  3y/t  =  ,  y(l)  = 
1/2.  First  we  find  the  general  solution;  since  we  are  interested  in  a  solution  with  a  given 
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condition  at  t  =  1,  we  may  assume  t  >  0.  We  start  by  solving  the  homogeneous  equation 
as  usual;  call  the  solution  g: 

g  =  ^e-/(3/*)'^*  =  =  At-\ 

Then  as  in  the  discussion,  h{t)  =        and  v'{t)  =  t^/t~^  =  t^,  so  v{t)  =  We  know 

that  every  solution  to  the  equation  looks  like 

v{t)t-^  +  At-^  =  —t-^  +  At-^  =  —  +  At-^. 


Finally  we  substitute  to  find  A: 


1 

(1)3 

2  ~ 

6 

1 

A  = 

2  ~ 

The  solution  is  then 


+  A{l)-'  =  ^+A 

1  _  1 
6  ~  3" 


1  o 

»  =  6  +  3' 


□ 


Here  is  an  alternate  method  for  finding  a  particular  solution  to  the  differential  equation, 
using  an  integrating  factor.  In  the  differential  equation  y  +  p{t)y  —  f{t),  we  note  that 
if  we  multiply  through  by  a  function  I{t)  to  get  I{t)y  +  I{t)p{t)y  =  I{t)f{t),  the  left  hand 
side  looks  like  it  could  be  a  derivative  computed  by  the  product  rule: 

j^my)  =  my  +  r{t)y. 

Now  if  we  could  choose  I{t)  so  that  I'{t)  =  I{t)p{t),  this  would  be  exactly  the  left  hand  side 
of  the  differential  equation.  But  this  is  just  a  first  order  homogeneous  linear  equation,  and 

we  know  a  solution  is  I{t)  =  e^^*\  where  Q{t)  =  J pdt;  note  that  Q{t)  =  —P{t),  where 

P{t)  appears  in  the  variation  of  parameters  method  and  P'{t)  —  —p.  Now  the  modified 
differential  equation  is 

e-^(*)y  +  e-^(*)p(t)y  =  e-^(*)/(t) 
|(e-^(S)  =  e-^(^)/(t). 
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Integrating  both  sides  gives 

If  you  look  carefully,  you  will  see  that  this  is  exactly  the  same  solution  we  found  by  variation 
of  parameters,  because  e~^'-*-*/(t)  =  f{t)/h{t). 

Some  people  find  it  easier  to  remember  how  to  use  the  integrating  factor  method  than 
variation  of  parameters.  Since  ultimately  they  require  the  same  calculation,  you  should 
use  whichever  of  the  two  you  find  easier  to  recall.  Using  this  method,  the  solution  of  the 
previous  example  would  look  just  a  bit  different:  Starting  with  y +  3?//t  =  t^,  we  recall  that 

 „     '   =  e  "  =  t  .  Then  we  multiply  through  by  the  integrating 

factor  and  solve: 

t^y  +  t^3y/t  =  t^f 
t^y  +  t'^3y  =  t' 

t^y  =  tVe 

y  =  tV6. 

This  is  the  same  answer,  of  course,  and  the  problem  is  then  finished  just  as  before. 
Exercises  17.3. 

In  problems  1-10,  find  the  general  solution  of  the  equation. 

1.  y +  4y  =  8  ^ 

2.  y  -  2y  =  6  ^ 

3.  y  +  ty  =  bt^ 

4.  y  +  ey  =  —2e  =^ 

5.  ij  -y  =  t^  ^ 

6.  2y  +  y  =  t^ 

7.  ty-2y  =  1/i,  i  >  0  ^ 

8.  ty  +  y  =  Vi,t>0^ 

9.  y cost  +  y sint  =  1,  -7r/2  <  t  <  7r/2  ^ 
10.  y  +  ysect  =  tant,  -7r/2  <  t  <  tt/2  =^ 
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17.4  Approximation 

We  have  seen  how  to  solve  a  restricted  coUection  of  differential  equations,  or  more  ac- 
curately, how  to  attempt  to  solve  them — we  may  not  be  able  to  find  the  required  anti- 
derivatives.  Not  surprisingly,  non-linear  equations  can  be  even  more  difiicult  to  solve.  Yet 
much  is  known  about  solutions  to  some  more  general  equations. 

Suppose  (f){t,  y)  is  a  function  of  two  variables.  A  more  general  class  of  first  order 
differential  equations  has  the  form  y  =  (f){t^y).  This  is  not  necessarily  a  linear  first  order 
equation,  since  0  may  depend  on  y  in  some  complicated  way;  note  however  that  y  appears 
in  a  very  simple  form.  Under  suitable  conditions  on  the  function  it  can  be  shown  that 
every  such  differential  equation  has  a  solution,  and  moreover  that  for  each  initial  condition 
the  associated  initial  value  problem  has  exactly  one  solution.  In  practical  applications  this 
is  obviously  a  very  desirable  property. 

EXAMPLE  17.4.1  The  equation  y  —  t  —  y^  is  a  first  order  non-linear  equation,  because 
y  appears  to  the  second  power.  We  will  not  be  able  to  solve  this  equation.  □ 

EXAMPLE  17.4.2  The  equation  y  =  y'^  is  also  non- linear,  but  it  is  separable  and  can 
be  solved  by  separation  of  variables.  □ 

Not  all  differential  equations  that  are  important  in  practice  can  be  solved  exactly,  so 
techniques  have  been  developed  to  approximate  solutions.  We  describe  one  such  technique, 
Euler's  Method,  which  is  simple  though  not  particularly  useful  compared  to  some  more 
sophisticated  techniques. 

Suppose  we  wish  to  approximate  a  solution  to  the  initial  value  problem  y  =  (f){t,y), 
y{to)  =  yo,  for  t  >  to-  Under  reasonable  conditions  on  0,  we  know  the  solution  exists, 
represented  by  a  curve  in  the  t-y  plane;  call  this  solution  f{t).  The  point  (to,2/o)  is  of 
course  on  this  curve.  We  also  know  the  slope  of  the  curve  at  this  point,  namely  (p{to,  yo). 
If  we  follow  the  tangent  line  for  a  brief  distance,  we  arrive  at  a  point  that  should  be  almost 
on  the  graph  of  /(t),  namely  {to  +  At,  yo  +  '/'(^Oj  yo)At);  call  this  point  Now  we 

pretend,  in  effect,  that  this  point  really  is  on  the  graph  of  /(t),  in  which  case  wc  again 
know  the  slope  of  the  curve  through  namely  (p{ti,yi).  So  we  can  compute  a  new 

point,  (^2,2/2)  =  (^1  +  At,  +  (/){ti,yi)At)  that  is  a  little  farther  along,  still  close  to  the 
graph  of  f{t)  but  probably  not  quite  so  close  as  We  can  continue  in  this  way, 

doing  a  sequence  of  straightforward  calculations,  until  we  have  an  approximation  y^) 
for  whatever  time  tn  we  need.  At  each  step  we  do  essentially  the  same  calculation,  namely 

(ti+i,  yi+i)  =  {ti  +  At,  yi  -h  4){ti,  yi)At). 

We  expect  that  smaller  time  steps  At  will  give  better  approximations,  but  of  course  it  will 
require  more  work  to  compute  to  a  specified  time.  It  is  possible  to  compute  a  guaranteed 
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upper  bound  on  how  far  off  the  approximation  might  be,  that  is,  how  far  is  from  f{tn)- 
Suffice  it  to  say  that  the  bound  is  not  particularly  good  and  that  there  are  other  more 
complicated  approximation  techniques  that  do  better. 

EXAMPLE  17.4.3  Let  us  compute  an  approximation  to  the  solution  for  ij  =  t  —  ^ 
y{0)  =  0,  when  t  =  1.  We  will  use  At  =  0.2,  which  is  easy  to  do  even  by  hand,  though  we 
should  not  expect  the  resulting  approximation  to  be  very  good.  We  get 

(ti,  yi)  =  (0  +  0.2,  0  +  (0  -  0^)0.2)  =  (0.2,  0) 

(^2, 2/2)  =  (0.2  +  0.2,  0  +  (0.2  -  0^)0.2)  =  (0.4,  0.04) 

(t3,  ^3)  =  (0.6,  0.04  +  (0.4  -  0.04^)0.2)  =  (0.6,  0.11968) 

(t4,  y^)  =  (0.8,  0.11968  +  (0.6  -  0.11968^)0.2)  =  (0.8,  0.23681533952) 

(^5,2/5)  =  (1.0,0.23681533952  +  (0.6  -  0.23681533952^)0.2)  =  (1.0,0.385599038513605) 

So  y{l)  ~  0.3856.  As  it  turns  out,  this  is  not  accurate  to  even  one  decimal  place.  Fig- 
ure 17.4.1  shows  these  points  connected  by  line  segments  (the  lower  curve)  compared  to 
a  solution  obtained  by  a  much  better  approximation  technique.  Note  that  the  shape  is 
approximately  correct  even  though  the  end  points  are  quite  far  apart. 


y 

A 

0.5  - 


Figure  17.4.1     Approximating  a  solution  to  y  =  t  —     ,  y{0)  =  0. 


If  you  need  to  do  Euler's  method  by  hand,  it  is  useful  to  construct  a  table  to  keep 
track  of  the  work,  as  shown  in  figure  17.4.2.  Each  row  holds  the  computation  for  a  single 
step:  the  starting  point  (ti,yi);  the  stepsize  At;  the  computed  slope  (f){ti,yi);  the  change 
in  y,  Ay  =  (p{ti,yi)At;  and  the  new  point,  {ti+i,yi+i)  =  [U  +  At,yi  +  Ay).  The  starting 
point  in  each  row  is  the  newly  computed  point  from  the  end  of  the  previous  row. 

It  is  easy  to  write  a  short  function  in  Sage  to  do  Euler's  method;  see  this  Sage  work- 
sheet. □ 


466     Chapter  17  Differential  Equations 


At 

Ay  =  (/)(t,  y)  At 

(t  + At,y  + Ay) 

(0,0) 

0.2 

0 

0 

(0.2,0) 

(0.2,0) 

0.2 

0.2 

0.04 

(0.4,0.04) 

(0.4,0.04) 

0.2 

0.3984 

0.07968 

(0.6,0.11968) 

(0.6,0.11968) 

0.2 

0.58... 

0.117... 

(0.8,0.236...) 

(0.8,0.236...) 

0.2 

0.743... 

0.148... 

(1.0,0.385...) 

Figure  17.4.2     Computing  witti  Euler's  Metliod. 

Euler's  method  is  related  to  another  technique  that  can  help  in  understanding  a  dif- 
ferential equation  in  a  qualitative  way.  Euler's  method  is  based  on  the  ability  to  compute 
the  slope  of  a  solution  curve  at  any  point  in  the  plane,  simply  by  computing  (f){t,y).  If 
we  compute  (^(t,  y)  at  many  points,  say  in  a  grid,  and  plot  a  small  line  segment  with  that 
slope  at  the  point,  we  can  get  an  idea  of  how  solution  curves  must  look.  Such  a  plot  is 
called  a  slope  field.  A  slope  field  for  (p  =  t  —  y"^  is  shown  in  figure  17.4.3;  compare  this 
to  figure  17.4.1.  With  a  little  practice,  one  can  sketch  reasonably  accurate  solution  curves 
based  on  the  slope  field,  in  essence  doing  Euler's  method  visually. 
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Figure  17.4.3     A  slope  field  for  y  =  t  —  y^. 

Even  when  a  differential  equation  can  be  solved  explicitly,  the  slope  field  can  help 
in  understanding  what  the  solutions  look  like  with  various  initial  conditions.  Recall  the 
logistic  equation  from  exercise  13  in  section  17.1,  y  =  ky{M  —  y):  y  is  a  population  at  time 
t,  M  is  a  measure  of  how  large  a  population  the  environment  can  support,  and  k  measures 
the  reproduction  rate  of  the  population.  Figure  17.4.4  shows  a  slope  field  for  this  equation 
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that  is  quite  informative.  It  is  apparent  that  if  the  initial  population  is  smaller  than  M  it 
rises  to  M  over  the  long  term,  while  if  the  initial  population  is  greater  than  M  it  decreases 
to  M.  It  is  quite  easy  to  generate  slope  fields  with  Sage;  follow  the  AP  link  in  the  figure 
caption. 
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Figure  17.4.4     A  slope  field  for  y  =  0.2y(10  —  y). 


Exercises  17.4- 

In  problems  1-4,  compute  the  Euler  approximations  for  the  initial  value  problem  for  0  <  t  <  1 
and  At  =  0.2.  If  you  have  access  to  Sage,  generate  the  slope  field  first  and  attempt  to  sketch  the 
solution  curve.  Then  use  Sage  to  compute  better  approximations  with  smaller  values  of  At. 

1.  y  =  t/y,  y(0)  =  1 

2.  y  =  ^  +  y^  y(0)  =  1 

3.  y  =  cos(t  +  y),  y(0)  =  1  ^ 

4.  y  =  tlny,  y(0)  =  2  ^ 


17.5    Second  Order  Homogeneous  Equations 

A  second  order  differential  equation  is  one  containing  the  second  derivative.  These  are 
in  general  quite  complicated,  but  one  fairly  simple  type  is  useful:  the  second  order  linear 
equation  with  constant  coefficients. 

EXAMPLE  17.5.1  Consider  the  intial  value  problem  y  -  y  -  2y  =  0,  y{0)  =  5,  y{0)  =  0. 
We  make  an  inspired  guess:  might  there  be  a  solution  of  the  form  e'^*?  This  seems  at  least 
plausible,  since  in  this  case  y,     and  y  all  involve  e'^*. 


468     Chapter  17  Differential  Equations 


If  such  a  function  is  a  solution  then 

e^i(r2-r-2)  =  0 
(r^  -  r  -  2)  =  0 
(r-2)(r  +  l)  =  0, 

so  r  is  2  or  —1.  Not  only  are  /  =  e^*  and  g  =  e~*  solutions,  but  notice  that  y  =  Af  +  Bg 
is  also,  for  any  constants  A  and  B: 

{Af  +  Bg)"  -  {Af  +  S^)'  -  2{Af  +  S^)  =  Af"  +  S^;"  -  A/'  -  Bg'  -  2Af  -  2Bg 

=  A{f"  -f'~  2f)  +  B{g"  -g'-  2g) 
=  A{0)  +  B{0)  =  0. 

Can  we  find  A  and  B  so  that  this  is  a  solution  to  the  initial  value  problem?  Let's  substitute: 
5  =  y(0)  =  Af{0)  +  Bg{0)  =  Ae°  +  Be'^  =  A  +  B 

and 

0  =  y{0)  =  Af'{0)  +  Bg'{0)  =  A2e^  +  B{-l)e^  =  2A-B. 

So  we  need  to  find  A  and  B  that  make  both  b  =  A-\-  B  and  =  2A  —  B  true.  This  is  a 
simple  set  of  simultaneous  equations:  solve  B  =  2A,  substitute  to  get  5  =  A  +  2A  =  3A. 
Then  A  =  5/3  and  B  =  10/3,  and  the  desired  solution  is  (5/3)e^*  +  (10/3)e~*.  You  now 
see  why  the  initial  condition  in  this  case  included  both  y(0)  and  y{0):  we  needed  two 
equations  in  the  two  unknowns  A  and  B  □ 

You  should  of  course  wonder  whether  there  might  be  other  solutions;  the  answer  is  no. 
We  will  not  prove  this,  but  here  is  the  theorem  that  tells  us  what  we  need  to  know: 

THEOREM  17.5.2  Given  the  differential  equation  ay  +  by  +  cy  —  0,  o  7^  0,  consider 
the  quadratic  polynomial  ax^  +  6x  +  c,  called  the  characteristic  polynomial.  Using  the 
quadratic  formula,  this  polynomial  always  has  one  or  two  roots,  call  them  r  and  s.  The 
general  solution  of  the  differential  equation  is: 

(a)  y  —  Ae^^  +  Se^*,  if  the  roots  r  and  s  are  real  numbers  and  r  ^  s. 

(b)  y  =  Ae""^  +  Bte^\  if  r  =  s  is  real. 

(c)  y  =  74cos(/5t)e"*  +  5sin(/5t)e"*,  if  the  roots  r  and  s  are  complex  numbers  a  +  /3i 
and  a  —  j3i. 
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EXAMPLE  17.5.3  Suppose  a  mass  m  is  hung  on  a  spring  with  spring  constant  k.  If  the 
spring  is  compressed  or  stretched  and  then  released,  the  mass  will  oscillate  up  and  down. 
Because  of  friction,  the  oscillation  will  be  damped:  eventually  the  motion  will  cease.  The 
damping  will  depend  on  the  amount  of  friction;  for  example,  if  the  system  is  suspended  in 
oil  the  motion  will  cease  sooner  than  if  the  system  is  in  air.  Using  some  simple  physics,  it 
is  not  hard  to  see  that  the  position  of  the  mass  is  described  by  this  differential  equation: 
my  +  by  +  ky  =  0.  Using  m  =  1,  6  =  4,  and  /c  =  5  we  find  the  motion  of  the  mass.  The 
characteristic  polynomial  is  +  4x  +  5  with  roots  (—4  ±  a/16  —  20)/2  =  —2  ±  i.  Thus 
the  general  solution  is  y  =  Acos(i)e~^*  +  i?  sin(t)e~^*.  Suppose  we  know  that  y{0)  =  1 
and  y{0)  =  2.  Then  as  before  we  form  two  simultaneous  equations:  from  y{0)  =  1  we  get 
1  =  ylcos(0)e°  +  i?sin(0)e°  =  A.  For  the  second  we  compute 

y  =  -2Ae"^*  cos(t)  +  Ae~^\- sin(t))  -  2Be~^^  sin(t)  +  Se"^*  cos(t), 

and  then 

2  =  -2Ae°  cos(O)  -       sin(O)  -  2Be^  sin(O)  +  Be^  cos(O)  =  -2A  +  B. 

So  we  get  A  =  1,  B  =  4,  and  y  =  cos(t)e~^*  +  4sin(t)e~^*. 

Here  is  a  useful  trick  that  makes  this  easier  to  understand:  We  have  y  =  (cost  + 
4sint)e~^*.  The  expression  cost  +  4sint  is  a  bit  reminiscent  of  the  trigonometric  formula 
cos(q;  —  /3)  =  cos(q!)  cos(/3)  +  sin(Q!)  sin(/3)  with  a  =  t.  Let's  rewrite  it  a  bit  as 

—  /   1  4 

17  cos  t  H  =  sm  t 

l7  y/17 


Note  that  {1/Vl7f  +  {4/Vl7f  =  1,  which  means  that  there  is  an  angle  /3  with  cos  /3 
l/y/vf  and  sin/5  =  4/Vl7  (of  course,  /3  may  not  be  a  "nice"  angle).  Then 


cos  t  +  4  sin  t  =  V  17  (cos  t  cos  /?  +  sin  /3  sin  t)  =  v  17  cos(t  —  /?) . 


Thus,  the  solution  may  also  be  written  y  =  vTZe  cos(t  —  f3).  This  is  a  cosine  curve  that 
has  been  shifted  (3  to  the  right;  the  \/l7e~^^  has  the  effect  of  diminishing  the  amplitude  of 
the  cosine  as  t  increases;  see  figure  17.5.1.  The  oscillation  is  damped  very  quickly,  so  in  the 
first  graph  it  is  not  clear  that  this  is  an  oscillation.  The  second  graph  shows  a  restricted 
range  for  t.  □ 

Other  physical  systems  that  oscillate  can  also  be  described  by  such  differential  equa- 
tions. Some  electric  circuits,  for  example,  generate  oscillating  current. 


EXAMPLE  17.5.4  Find  the  solution  to  the  intial  value  problem  y  —  4y  +  4y  =  0, 
y{0)  =  —3,  y{0)  =  1.  The  characteristic  polynomial  is      —  4x  +  4  =  (x  —  2)^,  so  there 
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Figure  17.5.1     Graph  of  a  damped  oscillation. 

is  one  root,  r  =  2,  and  the  general  solution  is  Ae^*  +  Bte^^.  Substituting  t  =  0  we  get 
-3  =  A  +  0  =  A.  The  first  derivative  is  2Ae^*  +  2Ste^*  +  Se^*;  substituting  t  =  0 
gives  1  =  2A  +  0  +  B  =  2A  +  B  =  2(-3)  +  B  =  ~6  +  B,  so  B  =  7.  The  solution  is 
-3e2*  +  7te2*.  □ 

Exercises  17.5. 

1.  Verify  that  the  function  in  part  (a)  of  theorem  17.5.2  is  a  solution  to  the  differential  equation 
ay  +  by  +  cy  =  0. 

2.  Verify  that  the  function  in  part  (b)  of  theorem  17.5.2  is  a  solution  to  the  differential  equation 
ay  +  by  +  cy  =  0. 

3.  Verify  that  the  function  in  part  (c)  of  theorem  17.5.2  is  a  solution  to  the  differential  equation 
ay  +  by  +  cy  =  0. 

4.  Solve  the  initial  value  problem  y  —  co^y  =  0,  y(0)  =  1,  y(0)  =  1,  assuming  w  7^  0.  =^ 

5.  Solve  the  initial  value  problem  2y  +  18y  =  0,  y(0)  =  2,  y(0)  =  15. 

6.  Solve  the  initial  value  problem  y  +  6y  +  5y  =  0,  y(0)  =  1,  y(0)  =  0.  =J> 

7.  Solve  the  initial  value  problem  y  —  y  —  12y  =  0,  y(0)  =  0,  y(0)  =  14.  =^> 

8.  Solve  the  initial  value  problem  y  +  12y  +  36y  =  0,  y(0)  =  5,  y(0)  =  —10.  =^> 

9.  Solve  the  initial  value  problem  y  —  8y  +  16y  =  0,  y(0)  =  —3,  y(0)  =  4.  =^ 

10.  Solve  the  initial  value  problem  y  +  5y  =  0,  y(0)  =  —2,  y(0)  =  5.  =^ 

11.  Solve  the  initial  value  problem  y  +  y  =  0,  y(7r/4)  =  0,  y{n /A)  =  2.  =^ 

12.  Solve  the  initial  value  problem  y  +  12y  +  37y  =  0,  y(0)  =  4,  y(0)  =  0.  ^ 

13.  Solve  the  initial  value  problem  y  +  6y  +  18y  =  0,  y(0)  =  0,  y(0)  =  6.  =^ 

14.  Solve  the  initial  value  problem  y  +  4y  =  0,  y(0)  =  v^,  y(0)  =  2.  Put  your  answer  in  the  form 
developed  at  the  end  of  exercise  17.5.3.  =^ 

15.  Solve  the  initial  value  problem  y  +  lOOy  =  0,  y(0)  =  5,  y(0)  =  50.  Put  your  answer  in  the 
form  developed  at  the  end  of  exercise  17.5.3.  =^ 
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16.  Solve  the  initial  value  problem  y  +  4y  +  13y  =  0,  y(0)  =  1,  y(0)  =  1.  Put  your  answer  in  the 
form  developed  at  the  end  of  exercise  17.5.3. 

17.  Solve  the  initial  value  problem  y  —  8y  +  25y  =  0,  y(0)  =  3,  y(0)  =  0.  Put  your  answer  in  the 
form  developed  at  the  end  of  exercise  17.5.3.  =^ 

18.  A  mass-spring  system  my  +  by  +  kx  has  k  =  29,  6  =  4,  and  m  =  1.  At  time  t  =  Q  the  position 
is  y(0)  =  2  and  the  velocity  is  y(0)  =  1.  Find  y{t).  =^ 

19.  A  mass-spring  system  my  +  by  +  kx  has  k  =  24,  b  =  12,  and  m  =  3.  At  time  t  =  0  the 
position  is  y(0)  =  0  and  the  velocity  is  y(0)  =  —  1.  Find  y{t).  =^> 

20.  Consider  the  differential  equation  ay  +  by  =  0,  with  a  and  b  both  non-zero.  Find  the  general 
solution  by  the  method  of  this  section.  Now  let  g  =  ij;  the  equation  may  be  written  as 
ag  +  bg  =  0,  a  first  order  linear  homogeneous  equation.  Solve  this  for  g,  then  use  the 
relationship  g  =  y  to  find  y. 

21.  Suppose  that  y{t)  is  a  solution  to  ay  +  by  +  cy  =  0,  y(to)  =  0,  y(io)  =  0.  Show  that  y{t)  =  0. 

17.6    Second  Order  Linear  Equations 

Now  we  consider  second  order  equations  of  the  form  aij  +  by  +  cy  =  /(i),  with  a,  6,  and 
c  constant.  Of  course,  if  a  =  0  this  is  really  a  first  order  equation,  so  we  assume  a  ^  0. 
Also,  much  as  in  exercise  20  of  section  17.5,  if  c  =  0  we  can  solve  the  related  first  order 
equation  ah  +  bh  =  f{t),  and  then  solve  h  =  y  for  y.  So  we  will  only  examine  examples  in 
which  c  7^  0. 

Suppose  that  yi{t)  and  2/2 (^)  are  solutions  to  aij  +  by  +  cy  =  f{t),  and  consider  the 
function  h  =  yi  —  y2.  We  substitute  this  function  into  the  left  hand  side  of  the  differential 
equation  and  simplify: 

a{yi-y2)" +  b{yi-y2y +  c{yi-y2)  =  ay'{ +  by[+cyi- {ay2 +by2  +  cy2)  =  f{t)-f{t)  =  0. 

So  /i  is  a  solution  to  the  homogeneous  equation  aij  +  by  +  cy  =  0.  Since  we  know  how 
to  find  all  such  /i,  then  with  just  one  particular  solution  2/2  we  can  express  all  possible 
solutions  yi,  namely,  yi  =  h  +  y2,  where  now  h  is  the  general  solution  to  the  homogeneous 
equation.  Of  course,  this  is  exactly  how  we  approached  the  first  order  linear  equation. 

To  make  use  of  this  observation  we  need  a  method  to  find  a  single  solution  1/2 ■  This 
turns  out  to  be  somewhat  more  difficult  than  the  first  order  case,  but  if  f{t)  is  of  a  certain 
simple  form,  we  can  find  a  solution  using  the  method  of  undetermined  coefficients, 
sometimes  more  whimsically  called  the  method  of  judicious  guessing. 

EXAMPLE  17.6.1  Solve  the  differential  equation  ij  -  y  -  6y  =  18t^  +  5.  The  general 
solution  of  the  homogeneous  equation  is  Ae^^  +  Be~^^ .  We  guess  that  a  solution  to  the 
non-homogeneous  equation  might  look  like  f{t)  itself,  namely,  a  quadratic  y  =  at^  -\-bt-{-c. 
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Substituting  this  guess  into  the  differential  equation  we  get 

y-y-6y  =  2a-  {2at  +  6)  -  6(at^  +bt  +  c)  =  -6at'^  +  (-2a  -  6b)t  +  {2a  -  b  -  6c). 
We  want  this  to  equal  18t^  +  5,  so  we  need 

-6a  =  18 
-2a  -  66  =  0 
2a  —  b  —  6c  =  5 

This  is  a  system  of  three  equations  in  three  unknowns  and  is  not  hard  to  solve:  a  ~  —3, 
b  =  1,  c  —  —2.  Thus  the  general  solution  to  the  differential  equation  is  Ae^^  +  Be~^*  — 
3t'^  +t-2.  □ 

So  the  "judicious  guess"  is  a  function  with  the  same  form  as  f{t)  but  with  undetermined 
(or  better,  yet  to  be  determined)  coefficients.  This  works  whenever  f{t)  is  a  polynomial. 

EXAMPLE  17.6.2  Consider  the  initial  value  problem  my  +  ky  =  —mg,  y{0)  =  2, 
y{0)  —  50.  The  left  hand  side  represents  a  mass-spring  system  with  no  damping,  i.e., 
b  —  0.  Unlike  the  homogeneous  case,  we  now  consider  the  force  due  to  gravity,  —mg, 
assuming  the  spring  is  vertical  at  the  surface  of  the  earth,  so  that  g  =  980.  To  be  specific, 
let  us  take  m  =  1  and  k  =  100.  The  general  solution  to  the  homogeneous  equation 
is  ^  cos  (lot)  +  5  sin  (lot).  For  the  solution  to  the  non-homogeneous  equation  we  guess 
simply  a  constant  y  =  a,  since  —mg  =  —980  is  a  constant.  Then  y  +  lOOj/  =  100a  so 
a  =  -980/100  =  -9.8.  The  desired  general  solution  is  then  ^cos(lOt)  +  S  sin(lOt)  -  9.8. 
Substituting  the  initial  conditions  we  get 

2  =  A- 9.8 
50  =  105 

so  ^  =  11.8  and  5  =  5  and  the  solution  is  11.8cos(10t)  +  5sin(10t)  -  9.8.  □ 

More  generally,  this  method  can  be  used  when  a  function  similar  to  f{t)  has  derivatives 
that  are  also  similar  to  /(t);  in  the  examples  so  far,  since  f{t)  was  a  polynomial,  so  were 
its  derivatives.  The  method  will  work  if  f{t)  has  the  form  p{t)e°'^  cos(/3t)  +  q{t)e°'^  sin(/3t), 
where  p{t)  and  q{t)  are  polynomials;  when  a  =  /3  =  0  this  is  simply  p(t),  a  polynomial. 
In  the  most  general  form  it  is  not  simple  to  describe  the  appropriate  judicious  guess;  we 
content  ourselves  with  some  examples  to  illustrate  the  process. 

EXAMPLE  17.6.3  Find  the  general  solution  to  y  +  7y  +  lOy  =  e^*.  The  characteristic 
equation  is     +  7r  +  10  =  (r  +  5)(r  +  2),  so  the  solution  to  the  homogeneous  equation  is 
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Ae~^^  +  5e~^*.  For  a  particular  solution  to  the  inhomogeneous  equation  we  guess  Ce^*. 
Substituting  we  get 

When  C  =  1/40  this  is  equal  to  f{t)  =       so  the  solution  is  Ae-^^+Be-'^*+{l/m)e^*.  □ 

EXAMPLE  17.6.4  Find  the  general  solution  to  y  +  7y  +  lOy  =  e~^*.  Following  the  last 
example  we  might  guess  Ce~^*,  but  since  this  is  a  solution  to  the  homogeneous  equation 
it  cannot  work.  Instead  we  guess  Cte~^*.  Then 

(-2Ce-^*  -  2Ce-2*  +  ACte-^^)  +  7{Ce-^*  -  2Cte-^^)  +  lOCte-^*  =  e-^\-3C). 

Then  C  =  -1/3  and  the  solution  is  Ae'^^  +  Be'^^  -  {l/3)te-^K  □ 

In  general,  if  f{t)  =  e^*  and  k  is  one  of  the  roots  of  the  characteristic  equation,  then 
we  guess  Cte'^^  instead  of  Ce^*.  If  k  is  the  only  root  of  the  characteristic  equation,  then 
Cte''*  will  not  work,  and  we  must  guess  Ct^e'^^. 

EXAMPLE  17.6.5  Find  the  general  solution  to  y  —  6y  +  9y  =  e^*.  The  characteristic 
equation  is  —  6r  +  9  =  (r  —  3)^,  so  the  general  solution  to  the  homogeneous  equation  is 
Ae^^  +  Bte^*.  Guessing  Ct^e^^  for  the  particular  solution,  we  get 

The  solution  is  thus  Ae^*  +  Bte^^  +  (l/2)t^e^*.  □ 

It  is  common  in  various  physical  systems  to  encounter  an  f(t)  of  the  form  acosiut)  + 
6sin(ci;t). 

EXAMPLE  17.6.6  Find  the  general  solution  to  y  +  6?)  +  25y  =  cos(4t).  The  roots 
of  the  characteristic  equation  are  —  3  ±  4z,  so  the  solution  to  the  homogeneous  equation 
is  e~^*(^cos(4t)  +  Ssin(4t)).  For  a  particular  solution,  we  guess  Ccos(4t)  +  Dsin(4t). 
Substituting  as  usual: 

(-16Ccos(4t)  +  -16r>  sin(4t))  +  6(-4Csin(4t)  +4r>cos(4t))  +25(Ccos(4t)  +  Dsin(4t)) 
=  (24L»  +  9C)  cos(4t)  +  (-24C  +  9L»)  sin(4t). 

To  make  this  equal  to  cos(4t)  we  need 

2AD  +  9C  =  1 
9L»  -  24C  =  0 

which  gives  C  =  1/73  and  D  =  8/219.  The  fuU  solution  is  then  e~^*(^cos(4t)+S  sin(4t))  + 
(1/73)  cos(4t)  +  (8/219)  sin(4t). 
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The  function  e~  (Acos(4t)  +  Bsm{4t))  is  a  damped  oscillation  as  in  example  17.5.3, 
while  (1/73)  cos(4t)  +  (8/219)  sin(4t)  is  a  simple  undamped  oscillation.  As  t  increases,  the 
sum  e~^^{Acos{At)  +  i?sin(4t))  approaches  zero,  so  the  solution 

e"^*(Acos(4t)  +  Ssin(4t))  +  (1/73)  cos(4t)  +  (8/219)  sin(4t) 

becomes  more  and  more  like  the  simple  oscillation  (1/73)  cos(4t)  +  (8/219)  sin(4t) — notice 
that  the  initial  conditions  don't  matter  to  this  long  term  behavior.  The  damped  portion 
is  called  the  transient  part  of  the  solution,  and  the  simple  oscillation  is  called  the  steady 
state  part  of  the  solution.  A  physical  example  is  a  mass-spring  system.  If  the  only  force 
on  the  mass  is  due  to  the  spring,  then  the  behavior  of  the  system  is  a  damped  oscillation. 
If  in  addition  an  external  force  is  applied  to  the  mass,  and  if  the  force  varies  according  to 
a  function  of  the  form  acos(a;t)  +  bsin^ut),  then  the  long  term  behavior  will  be  a  simple 
oscillation  determined  by  the  steady  state  part  of  the  general  solution;  the  initial  position 
of  the  mass  will  not  matter.  □ 

As  with  the  exponential  form,  such  a  simple  guess  may  not  work. 

EXAMPLE  17.6.7  Find  the  general  solution  to  y  +  16y  =  -sin(4t).  The  roots 
of  the  characteristic  equation  are  ±4i,  so  the  solution  to  the  homogeneous  equation  is 
Acos{4t)  +  Bsm{4t).  Since  both  cos(4t)  and  sin(4i)  are  solutions  to  the  homogeneous 
equation,  Ccos(4t)  +  Dsm{4t)  is  also,  so  it  cannot  be  a  solution  to  the  non-homogeneous 
equation.  Instead,  we  guess  Ct  cos(4t)  -|-  Dtsm{4t).  Then  substituting: 

(-16Ctcos(4t)  -  16L>sin(4t)  +  8L>cos(4t)  -  8Csin(4t)))  +  16(Ctcos(4t)  +  Dtsm{4t)) 
=  8Dcos{4t)  -8Csm(4t). 

Thus  C  =  1/8,  D  =  0,  and  the  solution  is  Ccos(4t)  +Dsm{4t)  +  (l/8)t cos(4t).  □ 

In  general,  if  f{t)  =  acos{ojt)  +  6sin(a;t),  and  ±uji  are  the  roots  of  the  characteristic 
equation,  then  instead  of  Ccos(a;t)  +Dsm{ujt)  we  guess  Ctcos{ujt)  +  Dtsm(ujt). 

Exercises  17.6. 

Find  the  general  solution  to  the  differential  equation. 

1.  y  —  lOy  +  25y  =  cost 

2.  y  +  2V2y  +  2y  =  10  ^ 

3.  y  +  16y  =  sf  +  3t  -  4  ^ 

4.  y  +  2y  =  cos(5t)  +  sin(5t)  =^ 

5.  y  -  2y  +  2y  =  e'*  ^ 

6.  y  -  6y  +  13  =  1  +  2t  +  e"*  ^ 
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7.  y  +  y  -  6y  =  =^ 

8.  y  -  4y  +  3y  =  e'* 

9.  y  +  16y  =  cos(4t)  =^ 

10.  y +  9y  =  3sin(3t)  ^ 

11.  y  +  12y  +  36y  =  6e"*^* 

12.  y  -  8y  +  16y  =  -2e'^* 

13.  y +  6y  +  5y  =  4  ^ 

14.  y  -  ij  -I2y  =  t  ^ 

15.  y  +  5y  =  8sin(2t)  ^ 

16.  y  -  4y  =  4e^*  ^ 

Solve  the  initial  value  problem. 

17.  y  -  y  =  3t  +  5,  y(0)  =  0,  y(0)  =  0  ^ 

18.  y  +  9y  =  4t,  y(0)  =  0,  y(0)  =  0  ^ 

19.  y  +  12y  +  37y  =  106^^*,  y(0)  =  4,  y(0)  =  0  ^ 

20.  y  +  6y  +  18y  =  cost  -  sint,  y(0)  =  0,  y(0)  =  2  ^ 

21.  Find  the  solution  for  the  mass-spring  equation  y  +  4y  +  29y  =  689cos(2t).  =^ 

22.  Find  the  solution  for  the  mass-spring  equation  3y  +  12y  +  24y  =  2  sint.  =^ 

23.  Consider  the  differential  equation  my  +  by  +  ky  =  cos{ujt),  with  m,  b,  and  k  all  positive  and 
b^  <  2mk;  this  equation  is  a  model  for  a  damped  mass-spring  system  with  external  driving 
force  cos{u]t).  Show  that  the  steady  state  part  of  the  solution  has  amplitude 

1 

y/Amk  —  26^ 

Show  that  this  amplitude  is  largest  when  lo  =   .  This  is  the  resonant  frequency 

2m 

of  the  system. 
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The  method  of  the  last  section  works  only  when  the  function  f(t)  in  ay  +  by  +  cy  =  f{t) 
has  a  particularly  nice  form,  namely,  when  the  derivatives  of  /  look  much  like  /  itself.  In 
other  cases  we  can  try  variation  of  parameters  as  we  did  in  the  first  order  case. 

Since  as  before  a  7^  0,  we  can  always  divide  by  a  to  make  the  coefficient  of  y  equal 
to  1.  Thus,  to  simplify  the  discussion,  we  assume  a  =  1.  We  know  that  the  differential 
equation  y  +  by  +  cy  =  0  has  a  general  solution  Ayi  +  By2.  As  before,  we  guess  a  particular 
solution  to  y  +  by  +  cy  =  f{t);  this  time  we  use  the  guess  y  =  u{t)yi  +  v{t)y2-  Compute 
the  derivatives: 

y  =  uyi+  uyi  +  vy2  +  vy2 

y  =  uyi+  iiiji  +  iiyi  +  uyi  -f  vy2  +  vy2  +  vy2  +  vy2. 
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Now  substituting: 

y  +  by  +  cy^uyi  +  iiiji  +  iiiji  +  uyi  +  vy2  +  vy2  +  vy2  +  vy2 

+  fowyi  +  fettyi  +  hvy2  +  &f  2/2  +  cuyi  +  ci;?/2 
=  (tti/i  +  buy  I  +  cttyi)  +  {vy2  +  6v?/2  +  cvy2) 

+  h{uyi  +  t;y2)  +  {uyi  +       +  i;?/2  +  vm)  +        +  vm) 
=  0  +  0  +  h{uyi  +  7)2/2)  +  {uyi  +  wyi  +  vy2  +  7)^2)  +  (wyi  +  vm)- 

The  first  two  terms  in  parentheses  are  zero  because  yi  and  2/2  are  solutions  to  the  associated 
homogeneous  equation.  Now  we  engage  in  some  wishful  thinking.  If  iiyi  +  vy2  =  0  then 
also  ilyi  +  iiyi  +  vy2  +  vy2  =  0,  by  taking  derivatives  of  both  sides.  This  reduces  the  entire 
expression  to  iiyi  +  vy2-  We  want  this  to  be  fit),  that  is,  we  need  iiiji  +  vy2  —  f{t).  So 
we  would  very  much  like  these  equations  to  be  true: 

uyi  +  vy2  =  0 

uyi  +  vm  =  /(*)• 

This  is  a  system  of  two  equations  in  the  two  unknowns  u  and  i),  so  we  can  solve  as  usual  to 
get  ii  —  g{t)  and  v  =  h{t).  Then  we  can  find  u  and  v  by  computing  antiderivatives.  This  is 
of  course  the  sticking  point  in  the  whole  plan,  since  the  antiderivatives  may  be  impossible 
to  find.  Nevertheless,  this  sometimes  works  out  and  is  worth  a  try. 

EXAMPLE  17.7.1  Consider  the  equation  y  —  5y  +  6y  =  sin  t.  We  can  solve  this  by  the 
method  of  undetermined  coefficients,  but  we  will  use  variation  of  parameters.  The  solution 
to  the  homogeneous  equation  is  Ae^^  +  Se^*,  so  the  simultaneous  equations  to  be  solved 
are 

we^*  +  ve^^  =  0 
2?ie^*  +  3i;e^^  =  sint. 

If  we  multiply  the  first  equation  by  2  and  subtract  it  from  the  second  equation  we  get 

ve^^  =  sint 
ij  =  e~^*  sint 

V  =  —  ^(3sint  +  cost)e~^*, 

using  integration  by  parts.  Then  from  the  first  equation: 

ii  =  -e-^He^*  =  -e~^*e~^*sin(t)e^*  =  -e~^*sint 
u=  ^  (2  sin  t  +  cos  t)e~^*. 
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Now  the  particular  solution  we  seek  is 

we^*  +  ve^^  =  ^  (2  sin t  +  cos  t)e"^*e^*  -      (3  sin t  +  cos  t)e~^*e^* 
5  10 

=  -  (2  sin  t  +  cos  t)  —  —{3  sin  t  +  cos  t) 
5  10 

1  .  ■ 

=  —  (smt  +  cost), 

and  the  solution  to  the  differential  equation  is  Ae^*  +  Be^*  +  (sint  +  cost)/10.  For  com- 
parison (and  practice)  you  might  want  to  solve  this  using  the  method  of  undetermined 
coefficients.  □ 

EXAMPLE  17.7.2  The  differential  equation  y  —  by  +  6y  =  sint  can  be  solved  using 
the  method  of  undetermined  coefficients,  though  we  have  not  seen  any  examples  of  such  a 
solution.  Again,  we  will  solve  it  by  variation  of  parameters.  The  equations  to  be  solved 
are 

•ue^*  +  we^*  =  0 
2ue^*  +  3we^*  =  e*sint. 
If  we  multiply  the  first  equation  by  2  and  subtract  it  from  the  second  equation  we  get 

ve^^  —  e*  sint 

V  =  e~^*e*  sint  =  e~^*  sint 

V  =  -  ^  (2  sin  t  +  cos  t)e~^*. 

5 

Then  substituting  we  get 

u  =  -e~^^ve^^  =  -e~^*e~^*sin(t)e^*  =  -e~*sint 

1 /  .  ^  -f 

u  = -{smt  +  cost)e  . 

The  particular  solution  is 

1  1 

■ue^*  +  ve"^  =  -  (sin  t  +  cos  t)e"*e^*  -  -  (2  sin  t  +  cos  t)e"^*e^* 

2  5 

=  ^  (sin  t  +  cos  t)e*  —  -  (2  sin  t  +  cos  t)e* 
2  5 

=  ^(sint  +  3cost)e*, 

and  the  solution  to  the  differential  equation  is  Ae^*  +  Be^*  +  e*(sint  +  3cost)/10.  □ 
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EXAMPLE  17.7.3  The  differential  equation  y  -  2y  +  y  =  /t^  is  not  of  the  form 
amenable  to  the  method  of  undetermined  coefficients.  The  solution  to  the  homogeneous 
equation  is  Ae*  +  Bte^  and  so  the  simultaneous  equations  are 


Subtracting  the  equations  gives 


Then  substituting  we  get 


we*  +  vte^  =  0 
e* 


e* 

ue*  +  wte*  +  i;e*  — 


.  t  e* 

ve=-^ 

1 

1 

V  =  — . 

t 


■  f     .  f      1  t 

ue  =  -vte  =  -^te 
1 

ii  —  — 
t 

u  =  —  Int. 


The  solution  is  Ae*  +  Ste*  -  e*  Int  -  e*. 


□ 


Exercises  17.7. 

Find  the  general  solution  to  the  differential  equation  using  variation  of  parameters. 

1.  y  +  y  =  tana;  =^ 

2.  y  +  y  =  e  =^ 

3.  y  +  4y  =  sec  x  =^ 

4.  y  +  4y  =  tan  x  =^> 

5.  y  +  y  -  6y  =  t^e^*  =^ 

6.  y-2y  +  2y  =  e  tan(t)  ^ 

7.  y  —  2y  +  2y  =  sin(t)  cos(t)  (This  is  rather  messy  when  done  by  variation  of  parameters; 
compare  to  undetermined  coefficients.)  =^ 


A 

Selected  Answers 


1.1.1.  (2/3)a;+ (1/3) 

1.1.2.  y  =  -2x 

1.1.3.  (-2/3)x  + (1/3) 

1.1.4.  y  =  2a;  +  2,  2,  -1 

1.1.5.  y  =  —X  +  6,  6,  6 

1.1.6.  y  =  a;/2  +  l/2,  1/2,  -1 

1.1.7.  y  =  3/2,  y-intercept:  3/2,  no 
a;-intercept 

1.1.8.  y  =  {-2/3)x  -  2,  -2,  -3 

1.1.9.  yes 

1.1.10.  y  =  0,y=  -2x  +  2,  y  =  2x  +  2 

1.1.11.  y  =  75t,  164  minutes 

1.1.12.  y={9/5)x  +  32,  (-40,-40) 

1.1.13.  y  =  0.15a; +  10 

1.1.14.  0.03X  +  1.2 

1.1.15.  (a)    y  = 

0  0  <  x  <  100 

{x/10)  -  10  100  <  X  <  1000 

X  -  910  1000  <  X 


1.1.16.  y 

0.15a;  0  <  a;  <  19450 

0.28a;  -  2528.50  19450  <  a;  <  47050 
0.33a;  -  4881        47050  <x<  97620 

1.1.17.  (a)  P  = -0.0001a;  +  2 

(b)  a;=  -lOOOOP  +  20000 
(2/25)a;  -  (16/5) 


1.1.18 
1.2.1 


(a)  a;^  +     =  9 


(b)  (a;  -  5)2  +  (y-ef  =  9 

(c)  (a;  +  5)2  +  (y  +  6)2  =  9 
1.2.2.  (a)  Aa;  =  2,  Ay  =  3,  m  =  3/2, 

y  =  (3/2)a;-3,  ^13 

(b)  Aa;  =  —1,  Ay  =  3,  m  =  —3, 
y  =  -3a;  +  2,  ^10 

(c)  Aa;  =  -2,  Ay  =  -2,  m  =  1, 
y  =  X,  VS 

(x  +  2/7)2  +  (y  -  41/7)2  =  1300/49 


1.2.6 
1.3.1 
1.3.2 
1.3.3 
1.3.4 
1.3.5 


{a; 
{a; 
{a; 
{a; 

{x 


X  >  3/2} 
a;^-l} 

X  ^  1  and  X  ^  —1} 
a;  <  0} 

X  e  Ml,  i.e.,  all  x 
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1    Q  <? 
l.O.D. 

fry 

1.0.7. 

fry. 

{x 

1.0.0. 

{x 

ion 

1.0. y. 

{x 

1.0. lU. 

{x 

1.0.11. 

{x 

1.3.12. 

R 

1.3.13. 

{x 

1.3.14. 

A 

1.3.15. 

V 

x>0} 

h  —  r<x<h  +  r} 
X  >  1  or  X  <  0} 
-1/3  <x<  1/3} 
a;  >  0  and  x  ^  1} 
a;  >  0  and  x  ^  1} 


r(50  —  7rr^),  {r  \  0  <  r  < 

1.3.16.  A  =  2nr^  +  2000/r,  {r  |  0  <  r  <  00} 

2.1.1.  -5,  -2.47106145,  -2.4067927, 
-2.400676,  -2.4 

2.1.2.  -4/3,  -24/7,  7/24,  3/4 

2.1.3.  -0.107526881,  -0.11074197, 

-0.1110741,    ,  ,  ^  — 

'  3(3  + Ax)  9 

^  ^  ^    3  +  3Ax  +  Aa;2 

2.1.4.  >  3 

1  +  Ax 

2.1.5.  3.31,  3.003001,  3.0000, 
3  +  3Ax  +  Ax^  3 

2.1.6.  m 

2.2.1.  10,  25/2,  20,  15,  25,  35. 

2.2.2.  5,  4.1,  4.01,  4.001,  4  +  At  4 

2.2.3.  -10.29,  -9.849,  -9.8049, 
-9.8-4.9At^^  -9.8 

2.3.1.  7 

2.3.2.  5 

2.3.3.  0 

2.3.4.  undefined 

2.3.5.  1/6 

2.3.6.  0 


2.3.7. 

2.3.8. 

2.3.9. 
2.3.10. 
2.3.11. 
2.3.12. 
2.3.13. 
2.3.14. 
2.3.15. 
2.3.16. 
2.3.18. 


2.4.1. 

2.4.2. 

2.4.3. 

2.4.4. 

2.4.5. 

2.4.8. 

2.4.9. 
2.4.10. 
2.4.11. 

3.1.1. 

3.1.2. 

3.1.3. 

3.1.4. 

3.1.5. 

3.1.6. 

3.2.1. 

3  •  2  •  2  • 

3.2.3. 
3.2.4. 


3 

172 
0 

2 

does  not  exist 

V2 

3a^ 

512 

-4 

0 

(a)  8,  (b)  6,  (c)  dne,  (d)  -2,  (e)  -1, 
(f)  8,  (g)  7,  (h)  6,  (i)  3,  (j)  -3/2, 
(k)  6,  (1)  2 


-X/  V  169  — 
-9.8t 
2x  +  l/x^ 
2ax  +  b 
3x^ 

-2/(2a;+ 1)3/2 

5/(t  +  2)2 

y  =  -13x  +  17 

-8 

100x^9 

-lOOx-'^' 

-5x-^ 

TTX""-^ 

(3/4)x-V^ 

-(9/7)x-i6/^ 
15a;2  +  24a; 
-20x^  +  6x  +  10/x^ 
-30x  +  25 
6x^  +  2a;  -  8 
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3.2.5. 
3.2.6. 
3.2.7. 
3.2.S. 
3.2.9. 


3x^  +  6x-l 

y  =  13x/4  +  5 
y  =  24x-48-  tt^ 
-49t/5  +  5,  -49/5 


3.2.11.  ^kakx''-^ 


3.2.12. 
3.3.1. 
3*  3  •  2  • 

3a3*3* 

3.3.4. 
3*3*3  • 
3.4.1. 

3.4.2. 


fc=i 

„3 


a:716  -  3a:/4  +  4 

3x^{x^  -5x  +  10)  +  x^{3x^  -  5) 

{x^  +  5x-3)  {5x^  -  18x^  +  6x-7)  + 
{2x  +  5){x^  -  6x^  +  3x'^  -7x  +  l) 

-\/625  —  Xy/x 


2v^ 
-1 


^625" 


x^ 


20V625  -  x^ 


,19 


x^'^VQ^b  -  x^  X' 
f  =  4{2x-3),y  =  4x-7 
3x^  x^{3x^  -  5) 


„21 


3.4.3. 

3.4.4. 

3.4.5. 
3.4.6. 
3.4.8. 
3.4.9. 
3.5.1. 
3*3*2* 
3*3*3* 


x^-5x+  10     (a;3  -  5x  +  10)^ 

2x  +  5 
—  6x^  +  3x^  —  7x  +  1 
{x^  +  5x-  3)  (5x4  _  ig^2  +  Qx-7) 

{x^  —  6x^  +  3x'^  —  7x  +  1)^ 

1  x^/^ 
+ 


2V^V625  -  x^      (625  -  ^2)3/2 
-1  20V625  -  a;2 


a;i9V625  -  x^ 
y=\7x/A-A\/A 

y=  lla;/16-  15/16 

y  =  19/169  -  5a;/338 

13/18 

4x^  -9x'^  +X  +  7 
3x'^  -  4x  +  2/^ 
6{x^  +  ifx 


X 


21 


3.5.4. 
3.5.5. 

3.5.6. 
3.5.7. 

3.5.S. 

3.5.9. 

3.5.10. 

3.5.11. 
3.5.12. 

3.5.13. 

3.5.14. 

3.5.15. 


^/l69  -  x^  -  x^l^/^m^^ 

(2a;  -  4)^25 -a;2- 

{x^  -  4a;  +  5)a;/V25  -  x'^ 


—xlx/  — 


2a;Vvl  +  a;4 
1 

4v^(5  -  V^)3/2 
6  +  ISx 


x 


x  +  1 


2a;  +  1  ^ 
1  —  X        (1  —  a:)2 

-l/V25-a;2  -  V25-a;2/a;2 

-169 


1 


1 


169 


X 


3,x2  -  2x  +  1  /a:2 
2^Jx^  -x"^  -  {1/x) 
300a; 


(100-a;2)5/2 

1  +  3x2 
3(a;  +  a;3)2/3 


3.5.16.     4x(x2  +  1)  + 


4a;^  +  4a; 


2^1  +  (a;2  +  1)^ 


3.5.17. 
3.5.18. 
3.5.19. 
3.5.20. 
3.5.21. 
3.5.22. 
3*3*23* 
3.5.24. 
3*3*23* 
3.5.26. 
3.5.27. 


2^(x2  +  1)2  +  ^1  +  (a;2  +  1)2 

5(a;  +  8)4 
-3(4  -  xf 
6a;(a;^  +  5)^ 
-12a;(6  -  2x'^f 
24a;^(l  -4a;3)-^ 
5  +  5/x^ 

-8(4x-  l)(2x2  -a;  +  3)-2 
l/(x  +  l)2 

3(8x-2)/(4x2  -2a;  +  l)2 
— 3x^  +  5x  —  1 

6a;(2x  -  4)^  +  6(3a;2  +  l)(2a;  -  4)^ 
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3.5.28.  -2/{x-lf 

3.5.29.  Ax/{x^  +  lf 

3.5.30.  (a;^-6a;  +  7)/(a;-3)^ 

3.5.31.  -5/(3a;-4)2 

3.5.32.  eOx^  +  72x^  +  ISx^  +  ISx  -  6 

3.5.33.  {h-Ax)/{{2x  +  lf{x-2,f) 

3.5.34.  1/(2(2 +  3x)2) 

3.5.35.  hQx^  +  72x^  +  llOx^  +  lOOx^  + 

eOx^  +  28x  +  6 

3.5.36.  y  =  23a;/96  -  29/96 

3.5.37.  y  =  3  -  2a;/3 

3.5.38.  y  =  lSx/2  -  23/2 

3.5.39.  y  =  2x-ll 


3.5.40.  y  = 


20  +  2v^ 


X  + 


3V5 


5a/4+ 5a/4  + 

4.1.1.  2?T,7r  —  7r/2,  any  integer  n 

4.1.2.  mr  ±  7r/6,  any  integer  n 

4.1.3.  {V2  +  VQ)/A 

4.1.4.  -(1  +  V3)/{1  -  \/3)  =  2  +  V3 
4.1.11.  t  =  n/2 

4.3.1.  5 

4.3.2.  7/2 

4.3.3.  3/4 

4.3.4.  1 

4.3.5.  -V2/2 

4.3.6.  7 

4.3.7.  2 

4.4.1.  sin(A/a:)  cos(v^)/\/x 
sinx 


4.4.2. 
4.4.3. 
4.4.4. 


2v^ 
cos  a; 

sin^  X 


+  ^/x  COS  a; 


(2a;  +  1)  sin  a;  —  (a;^  +  a;)  cos  x 


4.4.5. 

4.5.1. 
4.5.2. 

4.5.3. 

4.5.4. 

4.5.5. 

4.5.6. 

4.5.7. 

4.5.8. 

4.5.9. 
4.5.10. 
4.5.11. 
4.5.12. 

4.5.13. 
4.5.14. 
4.5.15. 
4.5.16. 
4.5.17. 
4.5.18. 

4.7.1. 

4.7.2. 

4.7.3. 
4.7.4. 
4.7.5. 


sm  X  cos  X 


sin^  X 


cos^  X  —  sin^  X 


—  sin  a:  cos  (cos  a;) 

tan  x  +  X  sec^  x 


2^  tanx 
sec^  x{l  +  sin  x)  —  tan  x  cos  a; 


(1  +  sina;)^ 


csc^  X 


—  CSC  X  cot  a; 

3a;^  sin(23a;^)  +  46a;^  cos(23a;^) 
0 

—6  cos(cos(6a:))  sin(6a;) 

sin6'/(cos6'  +  1)2 

cos(6t)  -  sin(6t) 

3t2(sin(3t)  +  tcos(3t))/cos(2t)  + 
2t^  sin(3t)  sin(2t) /  cos^  (2t) 

n7r/2,  any  integer  n 

7r/2  +  nTT,  any  integer  n 

y3x/2  +  3/4  -  y37r/6 

8V3a;  +  4-8V37r/3 

3V3a;/2  -  v^7r/4 

7r/6  +  2n7r,  Stt/G  +  2n7r,  any  integer 
n 

21n(3)a:3^' 
cos  X  —  sin  a; 


2e' 

cos(e^) 
cos(a;)e®'"' 


sin^  X 


4.7.6.  a;"^"^    cos  a;  In  a;  + 


sma; 

X 
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4.7.7. 

4.7.8. 

4.7.9. 
4.7.10. 
4.7.11. 
4.7.12. 
4.7.13. 
4.7.14. 
4.7.15. 
4.7.20. 

4.8.1. 

4.8.2. 

4.8.3. 

4.8.4. 

4.8.5. 

4.8.6. 

4.8.7. 

4.8.8. 

4.8.9. 
4.8.11. 
4.8.12. 
4.8.13. 

4.8.14. 
4.8.15. 
4.8.16. 

4.9.3. 
4.9.5. 
4.9.6. 


1  +  2^ ln(2) 

-2x  lii(3)(l/3)^' 
e^^(4x-l)/a;2 
{Zx'^  +  2,)/{x^  +  2,x) 
—  tan(a;) 

(l-ln(a;2))/(a;2v^^) 
sec{x) 

a;°°^^^^(cos(x)/a:  -  cos(a:)  ln(a;)) 
e 

x/y 

-i2x  +  y)/ix  +  2y) 

{2xy  -  3x^  -  y^) /{2xy  -  3y'^  -  x^) 

sin(a;)  sin (j/)/ (cos (a;)  cos(y)) 

{y  sec^ (x/y)  -y^)/{x  sec^ {x/y) + ) 
{y  -  cos{x  +  y))/(cos(a;  -\- y)  -  x) 

1  I  2 

-y  /x 
1 

y  =  2a;  ±  6 
y  =  x/2±3 

(^3,2^),  (-V3,-2v^), 
(2^3,^3),  (-2^3,-^3) 

y  =  lx/^/^-%/^/2, 

,       1/3     ,     1/3       ,     1/3     X  ,  1/3 

y  =  \-yi   x  +  y^'  xi+x{  yi)/x{ 

{y  -  yi)/ix  -  xi)  =  {2x1  +  '^xiyl  - 
xi)/{2yf  +  2y^xl+yi) 

-1 
1  +  x^ 
2x 


4.9.7. 
4.9.8. 


-3a;^  cos(x^)/-y^l  -sin^(a;3^ 


(arcsin  x)\/i  —  x'^ 

4.9.9.  -e^/Vl  -  e^^ 
4.9.10.  0 

(1  +  lnx)x^ 


4.9.11. 


1  +  e^a; 


In  5(1  +  x'^^)  arctan(a;^) 

4.10.1.  0 

4.10.2.  oo 

4.10.3.  1 

4.10.4.  0 

4.10.5.  0 

4.10.6.  1 

4.10.7.  1/6 

4.10.8.  -oo 

4.10.9.  1/16 

4.10.10.  1/3 

4.10.11.  0 

4.10.12.  3/2 

4.10.13.  -1/4 

4.10.14.  -3 

4.10.15.  1/2 

4.10.16.  0 

4.10.17.  -1 

4.10.18.  -1/2 

4.10.19.  5 

4.10.20.  oo 

4.10.21.  oo 

4.10.22.  2/7 

4.10.23.  2 

4.10.24.  -oo 

4.10.25.  1 
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4.10.26. 

1 

5.1.8. 

min  at  a;  =  7tt/12  +  kir,  max  at 

4.10.27. 

2 

X  =  — 7r/12  +  A;7r,  for  integer  A;. 

4.10.28. 

1 

c;  1  o 

none 

4.10.29. 

0 

CI  1  1  n 

O.l.lU. 

local  max  at  a;  =  5 

4.10.30. 

1/2 

0. 1. 1 1. 

local  min  at  x  =  49 

4.10.31. 

2 

D.  1.  IZ. 

local  min  at  a;  =  0 

4.10.32. 

0 

o.l.lo. 

one 

4.10.33. 

oo 

5.2.1. 

min  at  a;  =  1/2 

4.10.34. 

1/2 

5.2.2. 

min  at  x  =  —1,  max  at  a;  =  1 

4.10.35. 

0 

5.2.O. 

max  at  a;  =  2,  min  at  a;  =  4 

4.10.36. 

1/2 

5.2.4. 

min  at  a;  =  ±1,  max  at  a;  =  0. 

4.10.37. 

5 

5.2.5. 

min  at  a;  =  1 

4.10.38. 

2^2 

5.2.6. 

none 

4.10.39. 

-1/2 

5.2.7. 

none 

4.10.40. 

2 

5.2.8. 

min  at  x  =  77r/12  +  /ctt,  max  at 

4.10.41. 

0 

Jb  —  — TT  /  IZ       AvTT,  iUi  iiilc^ci  Av. 

4.10.42. 

oo 

CI  o  o 

none 

4.10.43. 

0 

o.z.lU. 

max  at  X  =  0,  min  at  x  =  ±11 

4.10.44. 

3/2 

1^  9  1 1 

mm  all  X  —  — oj  z,  neiinei  cii  x  —  u 

4.10.45. 

oo 

CI  O  1  Q 

min  at  tt-tt,  max  at  7r/2  +  titt 

4.10.46. 

5 

5.2.14. 

min  at  2?i7r,  max  at  (2?i  +  l)7r 

4.10.47. 

-1/2 

K  o  1  c; 
5.Z.15. 

min  at  7r/2+2n7r,  max  at  37r/2+2n7r 

4.10.48. 

does  not  exist 

5.O.I. 

min  at  x  =  1  /2 

4.10.49. 

oo 

CJ  o  o 

5.O.2. 

min  at  a;  =  —  1,  max  at  a;  =  1 

4.10.50. 

y  =  1  and 

y  =  - 

-1 

CI  Q  Q 

o.o.o. 

max  at  a;  =  2,  min  at  a;  =  4 

5.1.1. 

min  at  a;  = 

=  1/2 

5.O.4. 

min  at  a;  =  ±1,  max  at  a;  =  0. 

5.1.2. 

min  at  a;  = 

=  -1, 

max  at  a;  =  1 

5.3.5. 

min  at  a;  =  1 

5.1.3. 

max  at  x  - 

=  2,  min  at  x  =  4 

5.3.6. 

none 

5.1.4. 

min  at  a;  = 

=  ±1,  max  at  a;  =  0. 

5.3.7. 

none 

5.1.5. 

min  at  a;  = 

=  1 

5.3.8. 

min  at  x  —  77r/12  +  nTr,  max  at 

5.1.6. 

none 

X  =  —tt/12  +  nTT,  for  integer  n. 

5.1.7. 

none 

5.3.9. 

max  at  a;  =  63/64 
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5.3.10. 
5.3.11. 
5.3.12. 
5.3.13. 
5.3.14. 
5.3.15. 
5.3.16. 
5.3.17. 
5.3.18. 

5.4.1. 

5.4.2. 

5.4.3. 
5.4.4. 


5.4.5. 
5.4.6. 
5.4.7. 

5.4.8. 


5.4.9. 
5.4.10. 


max  at  a;  =  7 

max  at  —5~^^^,  min  at  5~^^^ 
none 

max  at  —1,  min  at  1 
min  at  2"^/^ 
none 

min  at  mr 

max  at  mr,  min  at  7r/2  +  mr 

max  at  7r/2+2n7r,  min  at  37r/2+2n7r 

concave  up  everywhere 

concave  up  when  x  <  0,  concave 
down  when  x  >  0 

concave  down  when  x  <  3,  concave 
up  when  x  >  3 

concave  up  when  x  <  —1/V3  or 
X  >  l/\/3,  concave  down  when 
-1/V3<x  <  1/V3 

concave  up  when  a;<0ora;>2/3, 
concave  down  when  0  <  a;  <  2/3 

concave  up  when  x  <  0,  concave 
down  when  x  >  0 

concave  up  when  x<— lora:>l, 
concave  down  when  —  1  <  a;  <  0  or 
0  <  a;  <  1 

concave  down  on  ((8n  — l)7r/4,  (8n  + 
3)7r/4),  concave  up  on  ((8n  + 
3)7r/4,  {8n  +  7)7r/4),  for  integer  n 

concave  down  everywhere 

concave  up  on  (-00,  (21-\/497)/4) 


and  (21  +  7497)74,  oo) 

5.4.11.  concave  up  on  (0,  oo) 

5.4.12.  concave  down  on  (2n7r/3,  (2n  + 
l)7r/3) 

5.4.13.  concave  up  on  (0,oo) 


5.4.14.  concave  up  on  (— oo,  —1)  and  (0,  oo) 

5.4.15.  concave  down  everywhere 

5.4.16.  concave  up  everywhere 

5.4.17.  concave  up  on  (7r/4  +  n7r,  37r/4  +  n7r) 

5.4.18.  inflection  points  at  mr, 
±  arcsin(^2/3)  +  mr 

5.4.19.  up/incr:  (3,  oo),  up/decr:  (— oo,0), 
(2,3),  down/deer:  (0,2) 

6.1.1.  max  at  (2,  5),  min  at  (0, 1) 

6.1.2.  25  X  25 

6.1.3.  P/4  X  P/4 

6.1.4.  w  =  1  =  2-  52/^  h  =  52/^  h/w  = 

1/2 

6.1.5.  \/lOOx  \^x2\/l00,  h/s  =  2 

6.1.6.  w  =  l  =  21/VV3,  h  =  1/1/3/22/3^ 
h/w=  1/2 

6.1.7.  1250  square  feet 

6.1.8.  f/8  square  feet 

6.1.9.  $5000 
6.1.10.  100 
6.1.11. 

6.1.12.  h/r  =  2 

6.1.13.  h/r  =  2 

6.1.14.  r  =  5,h  =  40/7r,  h/r  =  S/tt 

6.1.15.  S/tt 

6.1.16.  4/27 

6.1.17.  Go  direct  from  A  to  D. 

6.1.18.  (a)  2,  (b)  7/2 

«  1  io    ^     v^S     1      1  V3 

6.1.19.  —  X  h  -  X  

6        6       2     4  12 

6.1.20.  (a)  a/6,    (b)   {a  +  b  - 

-a6  +  62)/6 
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6.1.21.  1.5  meters  wide  by  1.25  meters  tall 

6.1.22.  If  /c  <  2/7r  the  ratio  is  (2  -  /c7r)/4; 
a  k  >  2/7r,  the  ratio  is  zero:  the 
window  should  be  semicircular  with 
no  rectangular  part. 

6.1.23.  a/b 

6.1.24.  w  =  2r/\/3,  h  =  2V2r/VS 

6.1.25.  1/v^^  58% 

6.1.26.  18  X  18  X  36 

6.1.27.  r  =  5/(27r)^/^  ^  2.7  cm, 

h  =  5-  2^/^/7r^/^  =  4r  ^  10.8  cm 

.-,00,         750  /  27r2  \ 

/Tso^y/' 

6.1.29.  h/r  =  V2 

6.1.30.  The  ratio  of  the  volume  of  the 
sphere  to  the  volume  of  the  cone  is 
1033/4096  +  33/4096v^  ^  0.2854, 
so  the  cone  occupies  approximately 
28.54%  of  the  sphere. 

6.1.31.  P  should  be  at  distance  c\^/{\^+ 
v^)  from  charge  A. 

6.1.32.  1/2 

6.1.33.  $7000 

6.1.34.  There  is  a  critical  point  when 
sin^i/vi  =  sin02/f2,  and  the 
second  derivative  is  positive,  so 
there  is  a  minimum  at  the  critical 
point. 

6.2.1.  l/(167r)  cm/s 

6.2.2.  3/(10007r)  meters/second 

6.2.3.  1/4  m/s 

6.2.4.  -6/25  m/s 


6.2.5.  807r  mi/min 

6.2.6.  3A/5ft/s 

6.2.7.  20/(37r)  cm/s 

6.2.8.  13/20  ft/s 

6.2.9.  5^10/2  m/s 

6.2.10.  75/64  m/min 

6.2.11.  1457r/72m/s 

6.2.12.  257r/144  m/min 

6.2.13.  7rv^/36  ft^/s 

6.2.14.  tip:  6  ft/s,  length:  5/2  ft/s 

6.2.15.  tip:  20/11  m/s,  length:  9/11  m/s 

6.2.16.  380/V3  -  150  ^  69.4  mph 

6.2.17.  500/^3  -  200  ^  88.7  km/hr 

6.2.18.  18  m/s 

6.2.19.  136^475/19  ^  156  km/hr 

6.2.20.  -50  m/s 

6.2.21.  68  m/s 

6.2.22.  3800/v^  ~  210  km/hr 

6.2.23.  820/V329  +  150^57/^47  ^  210  km/hr 

6.2.24.  4000/49  m/s 

6.2.25.  (a)  X  =  a  cos  6*  —  a  sin  6*  cot(6'  +  (3)  — 
asin/3/ sin(6'  +       (c)  i;  ~  3.79  cm/s 

6.3.1.  X3  =  1.475773162 

6.3.2.  2.15 

6.3.3.  3.36 

6.3.4.  2.19  or  1.26 

6.4.1.  Ay  =  65/16,  dy  =  2 

6.4.2.  Ay  =  v^ll/lO  -  1,      =  0.05 

6.4.3.  Ay  =  sin(7r/50),  dy  =  7r/50 

6.4.4.  dV  =  87r/25 
6.5.1.  c=  1/2 


Appendix  A  Selected  Answers  487 


6.5.2.  c  =  Vl8-2 

6.5.6.  +  47a;^/2  -  5x  +  k 

6.5.7.  arctanx  +  fc 

6.5.8.  x^/A-lnx  +  k 

6.5.9.  -cos(2a;)/2  +  A; 

7.1.1.  10 

7.1.2.  35/3 

7.1.3.  x'^ 

7.1.4.  2x'^ 

7.1.5.  2x^-8 

7.1.6.  26^  -  2o^ 

7.1.7.  4  rectangles:  41/4  =  10.25, 

8  rectangles:  183/16  =  11.4375 

7.1.8.  23/4 

7.2.1.  {16/3)x^/^  +  C 

7.2.2.  t^  +  t  +  C 

7.2.3.  8y/x  +  C 

7.2.4.  -2/z  +  C 

7.2.5.  71ns +  C 

7.2.6.  (5x  + 1)^/15 +  C 

7.2.7.  {x-6f/3  +  C 

7.2.8.  2x^/2/5 +  C 

7.2.9.  -4/v^  +  C 

7.2.10.  4t  -     +  C,  t  <  2;     -  4t  +  8  +  C, 

t  >  2 

7.2.11.  87/2 

7.2.12.  2 

7.2.13.  In(lO) 

7.2.14.  -  1 

7.2.15.  3^/4 

7.2.16.  2^6  -  1/6 

7.2.17.  -  3x 


7.2.18.  2a;(a;^  -  3a;^) 

7.2.19.  e^' 

7.2.20.  2x6""^ 

7.2.21.  tan(x^) 

7.2.22.  2a;tan(a;^) 

7.3.1.  It  rises  until  t  =  100/49,  then  falls. 
The  position  of  the  object  at  time 
t  is  s{t)  =  -4.9t^  +  20t  +  k.  The 
net  distance  traveled  is  —45/2,  that 
is,  it  ends  up  45/2  meters  below 
where  it  started.  The  total  distance 
traveled  is  6205/98  meters. 

/■27r 

7.3.2.  /     siatdt  =  0 
Jo 

7.3.3.  net:  27r,  total:  27r/3  +  4V3 

7.3.4.  8 

7.3.5.  17/3 

7.3.6.  A=18,B  =  44/3,  C  =  10/3 

8.1.1.  -{1-ty^/lO  +  C 

8.1.2.  x^/5  +  2x^/3  +  x  +  C 

8.1.3.  (a;2  +  l)^°V202  +  C 

8.1.4.  -3(1  -5t)2/3/10  +  C 

8.1.5.  (sin^x)/4  +  C 

8.1.6.  -{100-x'^)^/^/3  +  C 

8.1.7.  -2^1  -a;3/3  +  C 

8.1.8.  sin(sin7rt)/7r  +  C 

8.1.9.  l/(2cos^a;)  =  (l/2)sec2a;  +  C 

8.1.10.  -ln|cosa;|  +  C 

8.1.11.  0 

8.1.12.  tan2(x)/2  +  C 

8.1.13.  1/4 

8.1.14.  —  cos(tana;)  +  C 

8.1.15.  1/10 
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8.1.16. 

8.1.17.  (27/8)  (x^  -  7)^/^ 

8.1.18.  -(3^  +  l)/14 

8.1.19.  0 

8.1.20.  f{xf/2 

8.2.1.  x/2-sm{2x)/A  +  C 

8.2.2.  -cosa;+ (cos^a;)/3  +  C 

8.2.3.  3x/8-  (sin2a;)/4+ (sin4a;)/32  +  C 

8.2.4.  (cos^a;)/5  -  (cos^x)/3  +  C 

8.2.5.  sinx  -  (sm^x)/3  +  C 

8.2.6.  x/S-  (sm4x)/32  +  C 

8.2.7.  {sin^x)/3- {sm^x)/5  +  C 

8.2.8.  -2{cosx)^/^/5  +  C 

8.2.9.  tan  a;  —  cot  a;  +  C 
8.2.10.  (sec^a;)/3  -  seca;  + C 

8.3.1.  —  In  I  csca;  +  cotxl  +  C 

8.3.2.  —  cscx  cot  a;/2  —  (1/2)  In  I  cscx  + 
cot  x\  +  C 

8.3.3.  xVx^  -  1/2-ln  \x+Vx^  -  l\/2+C 

8.3.4.  a;V9  +  4a;V2  + 

(9/4)  In \2x  +  A/9  +  4a;2|  +  C 

8.3.5.  -{1  -  x^f/^ /3  +  C 

8.3.6.  arcsin(a;)/8  —  sin(4arcsina;)/32  +  C 

8.3.7.  ln\x+  y^l  +  x^\+C 

8.3.8.  {x  +  l)^x^  +  2x/2  - 

In  |a;  +  1  +  Vx^  +  2x\/2  +  C 

8.3.9.  —  arctanx  —  1/a;  +  C 

8.3.10.  2arcsin(x/2)  -  x\^4  -  x'^ /2  +  C 

8.3.11.  arcsin(Va^)  —  \fx\J\  —  x  +  C 

8.3.12.  (2a;V  l)V4a;2  -  1/24  +  C 
8.4.1.  cos  a;  +  a;  sin  a;  +  C 


8.4.2.  a;^  sin  a;  —  2  sin  a;  +  2a;  cos  a;  +  C 

8.4.3.  (a;-l)e^  +  C 

8.4.4.  (l/2)e^'  +C 

8.4.5.  (a;/2)  -  sin(2a:)/4  +  C  = 
(a:/2)  —  (sina:cosa;)/2  +  C 

8.4.6.  slna;  — a;  +  C 

8.4.7.  (x^  arctanx  +  arctanx  —  a;)/2  +  C 

8.4.8.  — x^  cos  X  +  3x^  sin  X  +  6x  cos  X — 
6  sin  X  +  C 

8.4.9.  x^sinx  +  3x^cosx  —  Gxsinx  — 
6  cos  X  +  C 

8.4.10.  x^/4-  (cos^  x) /4-  (x  sin  x  cos  x) /2+ 
C 

8.4.11.  x/4  -  (x  cos^  x) /2  +  (cos  x  sin  x) /4  + 
C 

8.4.12.  xarctan(y^)  +  arctan(y^)  — y^+C 

8.4.13.  2sin(Vi)  -  2Vicos(v/^)  +  C 

8.4.14.  secxcscx  —  2cotx  +  C 

8.5.1.  -ln|x-2|/4  +  ln|x  +  2|/4  +  C 

8.5.2.  -xV3-4x-41n|x-2|  + 
41n|x  +  2|  +C 

8.5.3.  -l/(x  +  5)+C 

8.5.4.  -x-ln|x-2| +ln|x  +  2| +C 

8.5.5.  -4x  +  x^/3  +  8arctan(x/2)  +  C 

8.5.6.  (l/2)arctan(x/2  +  5/2) +  C 

8.5.7.  xV2-21n(4  +  x2)  +  C 

8.5.8.  (1/4)  In  |x  +  3|  -  (1/4)  In  |x  +  7|  +  C 

8.5.9.  (1/5)  In  |2x  -  3|  -  (1/5)  In  |1  +  x|  +  C 
8.5.10.  (1/3)  In  |x|  -  (1/3)  In  |x  +  3|  +  C 

8.6.1.  T,S:  4±0 

8.6.2.  T:  9.28125  ±  0.281125;  S:  9  ±  0 

8.6.3.  T:  60.75  ±  1;  S:  60  ±  0 

8.6.4.  T:  1.1167±0.0833;  S:  1.1000±0.0167 


8.6.5.  T:  0.3235  ±  0.0026;  S:  0.3217  ± 
0.000065 

8.6.6.  T:  0.6478  ±  0.0052;  S:  0.6438  ± 
0.000033 


8.6.7. 
8.6.8. 
8.6.9. 
8.6.10. 

8.7.1. 
8.7.2. 

8.7.3. 
8.7.4. 

8.7.5. 
8.7.6. 
8.7.7. 


T:  2.8833±0.0834;  S:  2.9000±0.0167 

T:  1.1170±0.0077;  S:  1.1114±0.0002 
T:  1.097  ±0.0147;  S:  1.089  ±0.0003 
T:  3.63  ±  0.087;  S:  3.62  ±  0.032 


(f  -  9)5/2 


(e*'  +  16)^ 


+  C 


+  C 


COS  ^  ~  2  cos  t  +  C 


tan^  t 


+  C 


2 

Inlt"^  +  t  +  3\  +  C 
1 


8 


ln|l  -4/t^l  +  C 


1 


8.7.8.  —  tan(arcsin(t/5))  +  C 


25 


t 


8.7.9. 

8.7.10. 
8.7.11. 

8.7.12. 
8.7.13. 
8.7.14. 
8.7.15. 


25^25-^2 
2 


+  C 


Vsin3t  +  C 
3 

t  tan  t  +  In  I  cos  t\  +  C 

2Ve*  ±  1  ±  C 
3t     sin2t     sin4t  ^ 
8+  —  +  ^+^ 
ln|f|      ln|f  +  3|  ^  ^ 


sin  arctan  t 
-1 


2(1  +  tant)2 


+  c  =  -Vi  +  tyt  +  c 
+  c 


8.7.16. 
8.7.17. 

8.7.18. 
8.7.19. 

8.7.20. 

8.7.21. 

8.7.22. 

8.7.23. 

8.7.24. 

8.7.25. 

8.7.26. 
8.7.27. 

8.7.28. 


9.1.1. 
9.1.2. 
9.1.3. 
9.1.4. 
9.1.5. 
9.1.6. 
9.1.7. 
9.1.8. 
9.1.9. 
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(^2  +  1)5/2       (^2  ^  1)3/2 


5  3 

e*  sin  t  —     cos  t  ^ 

 T.   +  C' 


(^3/2  +  47)^ 

6 
2 


3(2-^2)3/2  (2-^2)1/2 

In  I  sin(arctan(2t/3))  I 

9  +^  = 

(ln(4t2)  -  ln(9  +  4t2))/18  +  C 

(arctan(2t))2  ^  ^ 
4 

31n|t  +  3|  ^  ln|t-l|  ^  ^ 

4  4 

cos^  t     cos^  t  ^ 
-         +  C 


7 

-1 


t-3 
-1 
Int 

t2(lnt)2 


+  C 


t^lnt  t2 


ln(2t  +  1  -  75)  + 
ln(2t  +  1  +  V5)  +  C 


(t^  -  3*2  +  6t  -  6)e*  +  C 

5±  v^S. 

10 
5- 

10 
8^2/15 
1/12 
9/2 
4/3 

2/3-2/7r 

3/7r  -  3^3/ (27r)  -  1/8 
1/3 

10^5/3  -  6 
500/3 
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9.1.10.  2 

9.1.11.  1/5 

9.1.12.  1/6 

9.2.1.  I/tt,  S/tt 

9.2.2.  0,  245 

9.2.3.  20,  28 

9.2.4.  (3-7r)/(27r),  (18  -  12^3  +  7r)/(47r) 

9.2.5.  10/49  meters,  20/49  seconds 

9.2.6.  45/98  meters,  30/49  seconds 

9.2.7.  25000/49  meters,  1000/49  seconds 

9.2.8.  s{t)  —  cost,  v{t)  =  —  sint, 
maximum  distance  is  1, 
maximum  speed  is  1 

9.2.9.  s{t)  =  -  sm{nt) /tt^  +  t/n, 
v{t)  —  —  cos(7rt)/7r  +  I/tt, 
maximum  speed  is  2/7r 

9.2.10.  s{t)  =        -  sin(7rt)/7r2  +  t/yr, 
v{t)  —t  —  cos{ivt)/Tv  +  I/tt 

9.2.11.  s{t)  =  f/2  +  sin(7rt)/7r2  -  t/yr, 
v{t)  =t  +  cos(7rt)/7r  —  I/tt 

9.3.5.  87r/3 

9.3.6.  7r/30 

9.3.7.  7r(7r/2-l) 

9.3.8.  (a)  1147r/5  (b)  747r/5  (c)  207r 
(d)  47r 

9.3.9.  IGtt,  247r 
9.3.11.  TTh?{Zr-h)/Z 

9.3.13.  2tt 

9.4.1.  2/7r;  2/7r;  0 

9.4.2.  4/3 

9.4.3.  l/A 

9.4.4.  7r/4 

9.4.5.  -1/3,  1 


9.4.6.  -4V1224  ft/s;  -8V1224  ft/s 

9.5.1.  5,  305,  028,  516  N-m 

9.5.2.  4,  457,  854,  041  N-m 

9.5.3.  367,5007r  N-m 

9.5.4.  490007r  +  196000/3  N-m 

9.5.5.  24507r  N-m 

9.5.6.  0.05  N-m 

9.5.7.  6/5  N-m 

9.5.8.  3920  N-m 

9.5.9.  23520  N-m 
9.5.10.  12740  N-m 

9.6.1.  15/2 

9.6.2.  5 

9.6.3.  16/5 

9.6.5.  X  =  45/28,  y  =  93/70 

9.6.6.  X  =  0,  y  =  4/{37t) 

9.6.7.  X  =  1/2,  y  =  2/5 

9.6.8.  ^  =  0,  y  =  8/5 

9.6.9.  X  =  4/7,  y  =  2/5 

9.6.10.  x  =  y  =  1/5 

9.6.11.  X  =  0,  y  =  28/(97r) 

9.6.12.  x  =  y  =  28/(97r) 

9.6.13.  x  =  0,y  =  244/(277r)  w  2.88 

9.7.1.  oo 

9.7.2.  1/2 

9.7.3.  diverges 

9.7.4.  diverges 

9.7.5.  1 

9.7.6.  diverges 

9.7.7.  2 

9.7.8.  diverges 

9.7.9.  7r/6 
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9.7.10. 
9.7.11. 
9.7.12. 
9.7.13. 
9.7.14. 


9.8.2. 
9.8.3. 
9.8.4. 
9.8.5. 
9.8.9. 
9.9.1. 
9.9.2. 
9.9.3. 
9.9.4. 
9.9.6. 
9.9.7. 
9.9.8. 
9»  9  •  9  • 


diverges,  0 
diverges,  0 
diverges,  no  CPV 

80  mph:  90.8  to  95.3  N 
90  mph:  114.9  to  120.6  N 
100.9  mph:  144.5  to  151.6  N 

H  =  1/c,  a  =  1/c 

fi  =  {a  +  b)/2,  a  =  {a-hfjVl 

7/2 

21/2 

r  =  6 


(22V22-8)/27 
ln(2)  +  3/8 
a  +  a'Y3 
ln((\/2  +  l)/\/3) 
3/4 
fa  3.82 
f»  1.01 

a/TT?  -  V2  + 


9.10.1.  87rv^- 
9.10.3. 


167r\/2 


7307rv^  IOttv^ 


27 


27 


9.10.4. 
9.10.6. 
9.10.7. 

9.10.8. 


1 


TT  +  27re  +  -Tre  — 


TT 

4^ 


27r 

e 


87r2 


27r  + 


87r^ 


3^/3 

a  >  6:  27r6^+ 
27ra26 


arcsm  (Va^  -67a), 


a  <  6:  27r6^+ 

=  In  — I  

^/h"^  -  a?      \a  a 

10.1.2.  ^  =  arctan(3) 

10.1.3.  r  =  -4csc6' 

10.1.4.  r  =  sec^csc^  9 

10.1.5.  r  = 

10.1.6.  =  sin6'sec^6' 

10.1.7.  rsin^  =  sin(rcos^) 

10.1.8.  r  =  2/(sin^-5cos^) 

10.1.9.  r  =  2sec6' 

10.1.10.  0  =  r^cos^6'-rsin6'  +  l 

10.1.11.  0  =  3r^cos^6' -  2rcos6' -rsin6' 

10.1.12.  r  =  sin^ 

10.1.21.  {x^  +  y'^f  =  Ax'y  -  {x"  +  y^)y 

10.1.22.  (a;2  + 2/2)3/2  ^^2 

10.1.23.  a:2  +  y2  =  a;V 

10.1.24.  x^  +  x^y"^  =2/2 

10.2.1.  (^cos^  +  sin^)/(-^sin^  +  cos 6*), 

+  2)/(-^sin^  +  cos6')^ 

cos  6*  +  2  sin  6'  cos  9 

10.2.2.  2  , 

cos^  ^  —  sin  9  —  sin  9 

3(1  + sin  ^) 
(cos^  9  —  SIT?  9  —  sin  9)^ 

10.2.3.  (sin^^  -  cos2^)/(2sin^cos^), 
-l/(4sin3^cos3^) 

2sin^cos^  2 

10.2.4. 
10.2.5. 
10.2.6. 


cos^  9  —  s\v?  9 '  (cos^  9  —  s\v?  9)^ 
undefined 
2sin^  -  3sin^6' 
3cos3^-2cos^' 
3cos^g-3cos^g  +  2 
2cos3^(3cos2^-2)3 


10.3.1.  1 
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10.3.2.  97r/2 

10.3.3.  v^/3 

10.3.4.  7r/12  + 73/16 

10.3.5.  7raV4 

10.3.6.  4l7r/2 

10.3.7.  2-7r/2 

10.3.8.  7r/12 

10.3.9.  37r/16 

10.3.10.  7r/4-3A/3/8 

10.3.11.  7r/2  + 3^3/8 

10.3.12.  1 

10.3.13.  3/2-7r/4 

10.3.14.  7r/3  +  v^/2 

10.3.15.  77/3-^3/4 

10.3.16.  Att^/Z 

10.3.17.  TT^ 

10.3.18.  57r/24  -  ^3/4 

10.3.19.  77r/12-y3 

10.3.20.  47r-\/l5/2-7arccos(l/4) 

10.3.21.  37r^ 

sin(t)  ^  _  ^  cos(t) 


10.4.6.  x  =  t 


t=l 


2    '  2 

10.4.7.  a;  =  4cost  -  cos(4t), 
y  =  4  sin  t  —  sin(4t) 

10.4.8.  x  =  2cost  +  cos(2t), 
y  =  2  sint  —  sin(2t) 

10.4.9.  X  =  cost  +  t sin 
y  =  sin  t  —  t  cos  t 

10.5.1.  There  is  a  horizontal  tangent  at  aU 
multiples  of  tt. 

10.5.2.  97r/4 


10.5.3. 


10.5.4.  Four  points: 
-3-3^5 


±1 


'5-x/5 


-3  +  3^5    ,    /5  +  V5 

^— '±V^ 

10.5.5.  ll7r/3 

10.5.6.  32/3 

10.5.7.  27r 

10.5.8.  16/3 

10.5.9.  (7r/2,l) 

10.5.10.  57r^/6 

10.5.11.  27r^ 

10.5.12.  (27rA/47r2  +  1  +  ln(27r  + 
V47r2  +  l))/2 

11.1.1.  1 

11.1.3.  0 

11.1.4.  1 

11.1.5.  1 

11.1.6.  0 

11.2.1.  lim  nV(2n^  +  1)  =  1/2 

n— >oo 

11.2.2.  lim  5/(2^/"  +  14)  =  1/3 

n— >-oo 

CxD 

11.2.3.  Er=i  ^  diverges,  so  ^  3 


4  COS  t  dt 


11.2.4. 
11.2.5. 
11.2.6. 
11.2.7. 
11.2.8. 
11.2.9. 
11.3.1. 


verges 

-3/2 

11 

20 

3/4 

3/2 

3/10 

diverges 
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11.3.2.  diverges 

11.3.3.  converges 

11.3.4.  converges 

11.3.5.  converges 

11.3.6.  converges 

11.3.7.  diverges 

11.3.8.  converges 

11.3.9.  N  =  5 

11.3.10.  N  =10 

11.3.11.  N  =  1687 

11.3.12.  any  integer  greater  than  e 

11.4.1.  converges 

11.4.2.  converges 

11.4.3.  diverges 

11.4.4.  converges 

11.4.5.  0.90 

11.4.6.  0.95 

11.5.1.  converges 

11.5.2.  converges 

11.5.3.  converges 

11.5.4.  diverges 

11.5.5.  diverges 

11.5.6.  diverges 

11.5.7.  converges 

11.5.8.  diverges 

11.5.9.  converges 
11.5.10.  diverges 

11.6.1.  converges  absolutely 

11.6.2.  diverges 

11.6.3.  converges  conditionally 

11.6.4.  converges  absolutely 

11.6.5.  converges  conditionally 


200 


11.6.6. 
11.6.7. 
11.6.8. 
11.7.5. 
11.7.6. 
11.7.7. 
11.7.8. 
11.8.1. 
11.8.2. 
11.8.3. 
11.8.4. 
11.8.5. 
11.8.6. 
11.9.1. 

11.9.2. 

11.9.3. 

11.9.4. 

11.9.5. 
11.10.1. 
11.10.2. 
11.10.3. 


converges  absolutely 
diverges 

converges  conditionally 

converges 

converges 

converges 

diverges 

R=  1,1  =  (-1,1) 

i?  =  oo,  /  =  (— oo,  oo) 

R  =  e,  I  =  (— e,  e) 

R  =  e,  I  ={2-e,2  +  e) 

R  =  0,  converges  only  when  x  = 

i?  =  1,  /  =  [-6,  -4] 

the  alternating  harmonic  series 

oo 

J](n  +  l)x" 

n=0 

oo 

^(n  +  l)(n  +  2)a;^ 

n=0 

oo 

E 

n=0 


OO  ^ 

(n  +  l){n  +  2) 

n=0  ^  ^  ^  ^ 


X 


n+2 


^(-l)^a;2"/(2n)!,  i?  =  oo 

n=0 
OO 

J]a;"/n!,  R  =  oo 


n=0 

oo 


n=0 

oo 


11.10.4.  y (-1)^-1  i^-^,  i?  =  1 

n 
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11.10.5.  ln(2)  +  f:(-l)-^     ~5  ^  ^  -  2  ^^"^^^S^^ 


n=i  11.12.17.  diverges 


11.10.6.  Y.{-inn  +l){x-  ir,  i?  =  1      11-12.18.  (-00,  oo) 
n=o  11.12.19.  (-3,3) 


11.10.7.  1  +  ±  L±±_iH!i^,„  ^      11.12.20.  (-3.3) 

n=l  11.12.21.  (-1,1) 

oo 


I  _|_  ^2"'     l-^'  i?=i    11.12.22.  radius  is  0 — it  converges  only  when 

^  22^-i(n- l)!n!  '  n 
n=i  ^  a;  =  U 

11.10.8.  x  +  xys  11.12.23.  (-a/3,  a/3) 

11.10.9.  X](-1)"^'"+V(2n)!  11.12.24.  (-00,00) 

00 

11.12.25. 


11.10.10.  ^(-1)V+V^! 


V-  (ln(2))^  „ 

n=0 

n=0  °°    (  —  ^]'^ 

2         4  6  12  11.12.26.   '-X''+^ 

X       X        X  X  jL^  ri  +  1 

11.11.1.  1  +  +  ■■■  +  — r  n=o^^-^ 

2      24     720  12! 

11.11.2.  1000;  8  ,,_,2.27. 


 ^2n+l 

„3       0.5  fo^^+^ 


X  2x 

11.11.3.  x-\  \  ,  error  ±1.27. 

3       15  11.12.28.  1   +  a;/2  + 


11.12.1.  diverges 


11.12.2.  converges  Z— ^  2"n' 

n=2 


CO 


11.12.3.  converges 

11.12.4.  diverges  11.12.29.  '^{-1) 


n^2n 


n=0 

00 


11.12.5.  diverges 

11.12.6.  diverges  11.12.30.  ^ 

11.12.7.  converges 


(     1)"  j.2n+l 


2n+l 

n=0 


11.12.8.  converges  11.12.31.  7r=  >  (-1)^  

4^         2n  +  l 

11.12.9.  converges 


11.12.10.  converges  12.1.6.  3,  V26,  V29 

11.12.11.  converges  12.1.7.  \/l4,  2\/l4,  3\/l4. 

11.12.12.  converges  12.1.8.  (x  -  1)^  +  (y  -  1)^  +     -  1)^  =  4. 

11.12.13.  converges  12.1.9.  {x  -  2f  +  (y  +  if  +  {z  -  3)^  =  25. 

11.12.14.  converges  12.1.11.  {x  -  2f  +  (y  -  1)^  +  (^  +  1)^  =  16, 

11.12.15.  converges  {y  -  if  +  (^  +  1)^  =  12 


Appendix  A  Selected  Answers  495 


1  n  n 

12.2.6. 

VlO,  (0,-2),  (2,8)  2,  2vl7, 

12.3.17. 

(1/^3, -1/v^,  1/V3) 

(-2,-6) 

12.3.18. 

No. 

12.2.7. 

Via,  (0,4,0),  (2,0,6)  4,  2v^, 

12.3.19. 

Yes. 

(-2,-4,-6) 

(1,-2,1) 

12.4.1. 

12.2.8. 

A  (0,-2,3),  (2,2,-1)  %/l3,  3, 

12.4.2. 

(-2,0,-2) 

(4,  -6,  -2) 

12.2.9. 

^,  (1,-1,4),  (1,-1,-2)  3%/2,  %/6, 

12  4^ 

/— 7  13  — Q\ 
\    ' ,  -'-'J, 

(-2,2,-2) 

12.4.4. 

(0,-1,0) 

12.2.10. 

(2,1,0),  (4,3,2)  ^,  V29, 

12.4.5. 

3 

(-6,-4,-2) 

12.4.6. 

21^2/2 

12.2.11. 

(-3,-3,-11), 

12.4.7. 

1 

(-3/V139,  -3/V139,  -11/V139) 

12.5.1. 

{x-6)  +  {y-2)  +  (z 

(-I2/V139,  -12/\/139,  -44/V139) 

12.2.12. 

(0,0,0) 

12.5.2. 

A(x  +  1)  +  5{y  -  2)  - 

12.2.13. 

0;  (-rV3/2,r/2);  (0, -12r);  where 

12.5.3. 

(a;  -  1)  -  (y  -  2)  =  0 

TO    T"  ri  £i  T" o  *H  n  1  o   /~vT  T"  ri /li       1  ^ \  f'  XT' 

1   lo  Llie  IdCllUh  Ui  Llie  ClUCK 

12.5.4. 

-2(a;  -  1)  +3y  -2^  = 

Q 

o 

12.5.5. 

4(a;  _  1)  _  6y  =  0 

1  "2  ^  "2 

n 

u 

1  2  6 

12.3.3. 

2 

1  2  7 

/I  n  3\  -1-  ^/O  9  1  \ 
\i,  u,  o/  -t-  t\U,  Z,  1/ 

12.3.4. 

-6 

1  O  K  Q 

(1,0,3)  +t(i,^,-i; 

12.3.5. 

42 

T  O  K  Ci 

iz.o.y. 

-/-/II    1  \ 
*(i,  1,  -1) 

12.3.6. 

^/6/\/7,  «  0.39 

12.5.1U. 

-2/5,  13/5 

12.3.7. 

-ll^\/29/406,  «  2.15 

12.5.12. 

neither 

12.3.8. 

0,  7r/2 

Iz.o.lo. 

parallel 

12.3.9. 

1/2,  7r/3 

intersect  at  (3, 6, 5) 

lz.o.14. 

12.3.10. 

-1/v^,  2.19 

1  O  1 
X^.Q.XD. 

same  line 

Iz.o.ll. 

arccos(i/voj  ~  u.yo 

12.5.19. 

7/^3 

Vo,  \i, u;. 

12.5.20. 

4/Vl4 

U.o.lo. 

3V14/7,  (9/7,6/7,3/7). 

12.5.21. 

Vl31/v^ 

12.3.14. 

(0,5),  (5v^,0) 

12.3.15. 

(0,15^2/2), (15^2/2,0) 

12.5.22. 

a/68/3 

12.3.16. 

Any  vector  of  the  form 

12.5.23. 

V42/7 

(a,-7a/2,-2a) 

12.5.24. 

V2I/6 
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12.6.1.  (V2,7r/4,1), 

(V3, 7r/4,arccos(l/V3)); 
(7A77r/4,5), 

(yi23,77r/4,  arccos  (5/v^); 
(1,1,1),  (Al,7r/4);  (0,0, -tt), 
(7r,0,7r) 

12.6.2.  r'^  +  z'^  =  4: 

12.6.3.  rcos6'  =  0 

12.6.4.  +  2^2 +  2^-5  =  0 

12.6.5.  z  =  e"""' 

12.6.6.  z  =  r 

12.6.7.  sin^  =  0 

12.6.8.  1  =  pcos^ 

12.6.9.  p  =  2  sin  6*  sin  0. 

12.6.10.  psin(/)  =  2 

12.6.11.  COS0  =  l/\/2 

12.6.13.  z  =  mr;  cotcf)  —  miim^O,  (f)  —  0 
if  m  =  0 

12.6.14.  A  sphere  with  radius  1/2,  center  at 
(0,1/2,0) 

12.6.15.  0  <  ^  <  7r/2,  0  <  (j)<  7r/2,  p  >  0; 
0  <  ^  <  7r/2,  r  >  0,  2  >  0 

13.1.5.  (3cost3sint,2 -3sint) 

13.1.6.  {O,tcost,tsmt) 

13.2.1.  (2t,0,l),  r7Vl  +  4t2 

13.2.2.  (-sint,2cos2t,2t), 

r  V    sin^  i  +  4  cos2  ( 2t )  +  4t2 

13.2.3.  (— e*  sin(e*),    cos(e*),  cost), 
r'/ Ve^*  +  cos^  t 

13.2.4.  (v^/2,  v^/2,7r/4)  + 
t(- V2/2,  V2/2, 1) 

13.2.5.  (1/2,  \/3/2, -1/2)  + 
t(-^/2, 1/2,  2V^) 


13.2.6.  2/v^/V4  +  7r2 

13.2.7.  7a/5a/I7/85,  -9a/5a/17/85 

13.2.9.  (O,tcost,tsint),   (O,cost  - 

tsint,sint  +  tcost),  r'/\/l  +  t^, 

13.2.10.  (sint,  1 -cost,t2/2) 

13.2.11.  t  =  4 

13.2.12.  37,  1 

13.2.13.  {t^/2,t^/3,siat) 

13.2.16.  (1, 1, 1)  when  t  =  1  and  s  =  0; 
9  =  arccos ( 3/ vT3);  no 

13.2.17.  -6x  +  {y-TT)  =  0 

13.2.18.  -x/V2  +  y/V2  +  6z  =  0 

13.2.19.  (-1,-3,1) 

13.2.20.  (1/a/2,1/v^,0) +t(-l,  1,6^2) 

13.3.1.  2tvV13 

13.3.2.  (-8  +  13v/T3)/27 

13.3.3.  V5/2  +  ln(V5  +  2)/4 

13.3.4.  (85\/85- 13v^)/27 

13.3.5.  J^VTTe^dt 

13.3.6.  2v^/(2  + 4^2)3/2 

13.3.7.  2^2/(1  +  8t2)3/2 

13.3.8.  2V1  + +  9^4/(1  +  4^2  ^  9^4^3/2 

13.3.9.  12v^/289 

13.4.1.  (— sin t,  cost,  1),  (— cost,  —  sin t,  0), 
0,  1 

13.4.2.  (—  sint,  cost,  2t),  (—  cost,  —  sint,  2), 
4t/V4t2  +  1,  V4t2  +5/V4t2  +  1 

13.4.3.  (— sint, cost, e*), 

(— cost,  —  sint,  e*),  e^YVe^*  +  1, 
V2e2*  +  1/Ve2t  +  1 
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13.4.4. 

13.4.5. 
13.4.6. 

13.4.7. 
13.4.8. 

14.1.1. 

14.1.2. 
14.1.3. 

14.1.4. 

14.1.5. 
14.1.6. 

14.2.1. 

14.2.2. 

14.2.3. 

14.2.4. 

14.2.5. 

14.2.6. 

14.2.7. 

14.2.8. 

14.2.9. 
14.2.10. 
14.2.11. 
14.2.12. 

14.3.1. 


(e*,  cost,  e*),  (e*,  —  sint,  e*), 
(2e2*  -  cos  t  sin  t) / VSe^*  +  cos^  t, 
v^e* I  cos  t  +  sint|/V2e2*  +  cos2t 

(—3  sin  t,  2  cos  t,  0) ,  (3  cos  t,  2  sin  t,  0) 

(-3 sin t,  2 cost  +  0.1,0), 
(3cost,2sint  +  t/10,0) 

(—3  sin  t,  2  cos  t,  1) ,  (3  cos  t,  2  sin  t,  t) 

(-3sint,2cost  +  1/10,1), 
(3cost,2sint +  t/10,t) 

z  —  y'^,  z  =  x"^,  z  —  0,  lines  of  slope 
1 

z  =  \y\,  z  =  \x\,  z  =  2\x\,  diamonds 


z 
z 


sin(y^ 


e  ^  sin(a;^). 


-2X'' 


sin(2a;^),  circles 


0, 


z  =  —  sin(y),  z  =  sin(a;),  z 
lines  of  slope  1 

z  =  y'^,  z  =  x^,  z  =  0,  hyperbolas 

(a)  I  |x|  <3and  |y|  >  2} 

(b)  {{x,y)\l<x^  +  y^<3} 

(c)  {ix,y)\x''+4y^  <16} 

No  limit;  use  a;  =  0  and  y  =  0. 
No  limit;  use  x  =  0  and  x  =  y. 
No  limit;  use  x  —  0  and  x  =  y. 
Limit  is  zero. 
Limit  is  1. 
Limit  is  zero. 
Limit  is  —1. 
Limit  is  zero. 

No  limit;  use  x  =  0  and  y  =  0. 
Limit  is  zero. 
Limit  is  —1. 
Limit  is  zero. 

—2xysm{x^y),  —x'^  sm{x^y)  +  3y 


14.3.2. 
14.3.3. 
14.3.4. 
14.3.5. 

14.3.6. 

14.3.7. 

14.3.8. 

14.3.9. 
14.3.10. 
14.3.11. 
14.3.12. 

14.4.1. 

14.4.2. 

14.4.3. 

14.4.4. 
14.4.5. 
14.4.6. 
14.4.7. 


14.5.1. 
14.5.2. 
14.5.3. 
14.5.4. 
14.5.5. 
14.5.6. 
14.5.7. 
14.5.8. 
14.5.9. 


(y^  -  x'^y)/ix^  +  yf,  x^/ix"^  +  yf 

2,2  2,2 

y  ln{xy)  +     x  ln{xy)  +  x 


-x/y/1  -x^  -y"^, 


tan  y,  x  sec^  y 

-l/{x'y),  -l/{xy') 

z=  -2{x  -  1)  -  S{y  -  1)  -  1 

z=  1 

2  =  6(x-3)+3(y-l)  +  10 

z  =  (x-2)+4(y-l/2) 

r(t)  =  (2,1,4) +t(2, 4,-1) 

Axt  cos{x'^  +y^)  +  ^yt^  cos{x'^  +  y^) 

2xy  cos  t  +  2xH 

2xytcos{st)  +  2x^5,  2xyscos{st)  + 
2xH 

2xy'^t  —  4yx^s,  2xy^s  +  Ayx^t 

x/z,  Sy/{2z) 
-2x1  z,  -y/z 

(a)  V'  =  {nR-Q.2V)/P 

(b)  P'  =  {nR  +  0.6P)/2V 

(c)  T'  =  {3P  -OAV)/{nR) 

9^5/5 
V2cos3 
6^2(^3-  l)/4 
^3  +  5 

-V6{2  +  V3)/72 
-1/5,  0 

4(a;  -  2)  +  8(y  -  1)  =  0 

2{x  -  3)  +  3(y  -  2)  =  0 

(—1,-1  —  cos  1,  — cos  1), 
-V2  +  2cosl  +  2cos2l 
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14.5.10. 


14.5.11. 
14.5.12. 
14.5.13. 
14.5.14. 
14.5.15. 
14.5.16. 
14.5.18. 
14.5.19. 
14.5.20. 
14.6.1. 

14.6.2. 


14.6.3. 


Any  direction  perpendicular  to 
VT  =  (1,1,1),  for  example, 
(-1,1,0) 

2{x  -  1)  -  6(y  -  1)  +  6(^  -  3)  =  0 

6(a;  -  1)  +  2>{y  -  2)  +  2(z  -  3)  =  0 

(2  +  4t,-3-  12t,-l  -8t) 

(4  +  8t,  2  +  At,  -2  -  m) 

(4  +  8t,  2  4-  20t,  6  -  I2t) 

(0,1),  (4/5,  -3/5) 

(a)  (4,9)  (b)  (-81,2)  or  (81,-2) 

in  the  direction  of  (8, 1) 

V^7(-l,3)  =  (2,l) 

=  (2a:3y  -  6xy3)/(x2  +  y^f, 
fyy  =  {2xy^-6x^y)/{x^  +  y^)^ 

U  =  3x2^2,  /,  =  2x^y  +  hy\ 

fxx 

f^y  =  6x'^y 


6xy^  fyy  =  2x^  +  2Qy\ 


=  12x2  +  ^2^  fy  =  2xy, 


fxx  —  24a;,  fyy  —  2x,  fxy  —  2y 

14.6.4.  fx  ^  siny,  fy  =  a: cosy,  f^x  =  0, 
fyy  =  -xsiny,  f^y  =  cosy 

14.6.5.  fx  =  3cos(3a;)  cos(2y), 
fy  =  -2sin(3x)  sin(2y), 
fxy  =  -6cos(3x)  sin(2y), 
fyy  =  -4sin(3a;)  cos(2y), 
fxx  =  -9sin(3x)  cos(2|/) 


14.6.6. 


/.  =  e^+y  ,  /,  =  2ye-+y 

f      —  pX+y'^ 
J  XX  —  ^  , 

fyy  =  4y^e-+y"  +2e-+y\ 


14.6.7. 


fxy  —  2?/e 

fx  — 


2{x^  +  y^y  'y 

_  Zx 


2y- 


+  y4  ■ 


fxy 


Qy" 


8y 


6 


14.6.8. 


—Gx'^y^ 

_  -X       _  -y 

16^' 4^' 
16^2  + 


;     +  y^ 
16^3 


^-2/  -  64^3 


14.6.9. 


=  2 


y  +  ^ 


X  +  z 


^     x  +  z 
(x  +  y)-''  \x^yY' 
2z 


^xy  — 


14.7.1. 
14.7.2. 
14.7.3. 
14.7.4. 
14.7.5. 
14.7.6. 
14.7.7. 
14.7.8. 
14.7.9. 
14.7.10. 


(x  +  yY 
minimum  at  (1,-1) 
none 


none 


14.7.11 
14.7.12 
14.7.13 


maximum  at  (1,-1/6) 
none 

minimum  at  (2,-1) 

/(2,2)  =  -2,  /(2,0)  =  4 

a  cube  1  /  \/2  on  a  side 

65/3  X  65/3  X  130/3 

It  has  a  square  base,  and  is  one  and 
one  half  times  as  tall  as  wide.  If 
the  volume  is  V  the  dimensions  are 

^2y/3  X  ^2y/3  X  ^/WJl. 

v/100/3 


x^  ^-y'^ 


2{x^  +  y4)2  ■ 


\axQ  +  hyQ  +  czq  —  d\l \J ^l?  ^  (P- 

The  sides  and  bottom  should  all  be 
2/3  meter,  and  the  sides  should  be 
bent  up  at  angle  7r/3. 

14.7.14.  (3,4/3) 
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14.7.16. 
14.7.17. 
14.7.19. 

14.8.1. 

14.8.2. 

14.8.3. 


14.8.4. 
14.8.5. 
14.8.6. 
14.8.7. 

14.8.8. 

14.8.9. 
14.8.10. 
14.8.11. 
14.8.12. 
14.8.13. 
14.8.14. 
14.8.15. 
14.8.16. 
14.8.17. 

15.1.1. 

15.1.2. 

15.1.3. 

15.1.4. 

15.1.5. 

15.1.6. 

15.1.7. 

15.1.8. 


\b\ifb<  1/2,  otherwise  ^6-  1/4 
|6|  if  6  <  1/2,  otherwise  ^6-  1/4 
1024/^3 
a  cube 

65/3  X  65/3  X  130/3 

It  has  a  square  base,  and  is  one  and 
one  half  times  as  tall  as  wide.  If 
the  volume  is  V  the  dimensions  are 
^2V/3  X  {/2V/3  X  ^9V/4. 

\axo  +  byo  +  czq  —  d\/\/a?'  +  b'^  + 
(0,0,1),  (0,0,-1) 


v^x  v^x  ^F/16 

Farthest:  (-^,  V2,2  +  2^2); 
closest:  (2,0,0),  (0,-2,0) 

X  —  y  =  z  =  16 

(1,2,2) 

(0,  V5,-V5),  (0,-V^,V5) 
standard  $65,  deluxe  $75 
X  =  9,  (f)  =  7r/3 
35,  -35 

maximum  e^,  no  minimum 
5,  -9/2 
3,  3,  3 

a  cube  of  side  length  2/V3 

16 

4 

15/8 
1/2 

5/6 

12-65/(2e). 

1/2 

7r/64 


15.1.9. 
15.1.10. 
15.1.11. 
15.1.12. 
15.1.13. 

15.1.14. 

15.1.15. 
15.1.16. 
15.1.17. 
15.1.18. 
15.1.19. 
15.1.20. 
15.1.21. 
15.1.22. 
15.1.23. 
15.1.24. 
15.1.25. 
15.1.26. 
15.1.27. 

15.1.28. 


15.1.30. 
15.2.1. 
15.2.2. 
15.2.3. 
15.2.4. 
15.2.5. 
15.2.6. 
15.2.7. 
15.2.8. 


(2/9)23/2  _  (2/9) 
(1  -cos(l))/4 

(2V^-  l)/6 

7r-2 

(e^  -  l)/6 

4  TT 

15  ^  4 
1/3 

448 

4/5 

Stt 

2 

5/3 
81/2 

2aV3 
An 
7r/32 
31/8 
128/15 
18007r  m^ 
(e^  +  8r-  +  16) 
15 

e5/2  1 


Ve  +  4  - 


5\/5 


15  ^  15 
16-8^2 
47r 

327r/3  -  AVStt 
(2  -  V2)tt/3 
4/9 

57r/3 

7r/6 

7r/2 

7r/2-  1 


500    Appendix  A  Selected  Answers 


15.2.9.  y3/4  +  7r/6 

15.2.10.  8  +  7r 

15.2.11.  7r/12 

15.2.12.  (1 -cos(9))7r/2 

15.2.13.  -a^/lb 

15.2.14.  127r 

15.2.15.  TT 

15.2.16.  16/3 

15.2.17.  2l7r 
15.2.19.  2tt 

15.3.1.  x  =  y  =  2/Z 

15.3.2.  S  =  4/5,  y  =  8/15 

15.3.3.  S  =  0,  y  =  37r/16 

15.3.4.  X  =  0,  y  =  16/(157r) 

15.3.5.  X  =  3/2,  y  =  9/4 

15.3.6.  X  =  6/5,  y  =  12/5 

15.3.7.  X  =  14/27,  y  =  28/55 

15.3.8.  (3/4,2/5) 

/  81v^ 


15.3.9. 


I  807r 


0 


15.3.10.  X  =  7r/2,  y  =  7r/8 

(>27r  />l+cos9 

15.3.11.  M   =  /  (2  + 

Jo  io 
cos  6)r  dr  d9, 

=    I      I  sme{2  + 


0 


0 


cos^)r  drd9, 

M,.  =    I      I  cos6'(2  + 


0 


0 


COS  9)r  dr  d9. 

f7r/2  ncosO 

15.3.12.  M  =  I        I       {r  +  l)rdrd9, 

r/2 

t/2  /.cos6» 


M 


-7I-/2  io 

/7r/2  nCOSO 
/       sin  6'(r  + 
-7r/2  Jq 


iydrd9, 

/7r/2      /"COS  6 
/       COS  6'(r  + 
-7r/2  Jo 

/7r/2  /-l+cose 
/  rdrd9  + 

-Tv/2  J  COS  9 
n3-K/2  /.l+cose 

/        /  r  dr  d9, 

J-k/2  Jo 


t/2 

/7t/2  i>1+cos9 
/  sin  9drd9  + 

-tt/2  Jcose 
f37r/2  /-l+cose 

2 


n^TT/Z  pi 

Jtt/2  Jo 


r  sm9drd9, 


-tt/2  /-l+cose 


My=l        I  cos  6'  d9+ 


15.4.1. 
15.4.2. 
15.4.3. 
15.4.4. 
15.4.5. 
15.4.6. 

15.4.7. 

15.4.8. 
15.5.1. 
15.5.2. 
15.5.3. 
15.5.4. 
15.5.5. 
15.5.6. 
15.5.7. 
15.5.8. 


/TT/A  ni 
-tt/2  J  cose 
/■37r/2  /-l+cose 

r'^  COS  9  dr  d9. 

Jtt/2  Jo 


Tra 


V3/2 

7rA/2/8 

7r/2  -  1 

d^Va^  +  6^  +  c2 
2a6c 

8y37r/3 

11/24 

623/60 

-3eV4  +  2e-3/4 

1/20 

7r/48 

11/84 

151/60 

TT 
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15.5.10. 

Stt 
16 
32 

16.2.3. 

3sin(4)/2 

15.5.11. 

16.2.4. 

0 

15.5.12. 

64/3 

16.2.5. 

2e^ 

15.5.13. 

X  =  y  =  0,  z  =  16/15 

16.2.6. 

128 

15.5.14. 

^  =  'V  =  0,  ^=1/3 

16.2.7. 

(9e-3)/2 

15.6.1. 

7r/12 

16.2.8. 

ge+l  _  ge  _  gl/e-l  ^  gl/e  ^  g4/4  _ 

15.6.2. 

ttCI  -  \/2/2) 

15.6.3. 

57r/4 

16.2.9. 

1  +  sin(l)  -  cos(l) 

15.6.4. 

0 

16.2.10. 

31n3-21n2 

15.6.5. 

57r/4 

16.2.11. 

3/20  +  101n(2)/7 

15.6.6. 

4/5 

16.2.12. 

21n5  -  21n2  +  15/32 

15.6.7. 

2567r/15 

16.2.13. 

1 

15.6.8. 

47r2 

16.2.14. 

0 

15.6.9. 

37r 

ID 

rrkh'^a'^ /12 

16.2.15. 

21  +  cos(l)  -  cos(8) 

15.6.10. 

16.2.16. 

(ln29-ln2)/2 

15.6.11. 

nkha^ /Q 

16.2.17. 

21n2  +  7r/4-2 

15.6.12. 

16.2.18. 

1243/3 

15.6.13. 

47r/5 

16.2.19. 

ln2  +  11/3 

15.6.14. 

157r 

16.2.20. 

3cos(l)  —  cos(2)  —  cos(4)  —  cos(8) 

15.6.15. 

9A;7r(5y2-2y5)/20 

16.2.21. 

-10/3 

15.7.1. 

47rv^/3 

16.3.1. 

no  / 

15.7.2. 

0 

16.3.2. 

x^/4  - 

15.7.3. 

2/3 

16.3.3. 

no  / 

15.7.4. 
15.7.5. 

e2-l 

2e^ 
36 

16.3.4. 
16.3.5. 

no  / 
y  sinx 

15.7.6. 

32(^2 +  ln(l  +  v^))/3 

16.3.6. 

no  / 

15.7.7. 

3cos(l)  -  3cos(4) 

16.3.7. 

xyz 

15.7.8. 

7r(l  -  cos(l))/24 

16.3.8. 

414 

1    7  1  n 

XO.  1  .xu. 

16.3.9. 

6 

16.2.1. 

13yiT/4 

16.3.10. 

1/e  —  sin  3 

16.2.2. 

0 

16.3.11. 

1/^-  1/V3 
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16.4.1.  1 

16.4.2.  0 

16.4.3.  l/(2e)-l/(2e^)+e/2-e72 

16.4.4.  1/2 

16.4.5.  -1/6 

16.4.6.  (2v^-10y5+8V6)/3-2V2/5+l/5 

16.4.7.  11/2 -ln(2) 

16.4.8.  2-7r/2 

16.4.9.  -17/12 

16.4.10.  0 

16.4.11.  -7r/2 

16.4.12.  127r 

16.4.13.  2cos(l)  -  2sin(l)  -  1 

16.5.1.  -1,  0 

16.5.2.  0,  a  +  b 

16.5.3.  (26-a)/3,  0 

16.5.4.  0,  1 

16.5.5.  -27r,  0 

16.5.6.  0,  27r 

16.6.3.  25v^/4 

16.6.4.  TvV21 

16.6.5.  7r(5y5-l)/6 

16.6.6.  47ry2 

16.6.7.  7raV2 

16.6.8.  27ra(a  -  Va^  -  6^) 

16.6.9.  7r((l  +  4a2)3/2-l)/6 

16.6.10.  27r((l +  a2)3/2  _  i)/3 

16.6.11.  Tra^  -  2a^ 

16.6.12.  TraVl  + 

16.6.13.  AVl  +  a2  +  62 

16.6.14.  ylv^FTT 

16.6.15.  Sa^ 


16.7.1.  (0,0,3/8) 

16.7.2.  (11/20,11/20,3/10) 

16.7.3.  (0,0,1364/425) 

16.7.4.  on  center  axis,  h/3  above  the  base 

16.7.5.  16 

16.7.6.  7 

16.7.7.  -TT 

16.7.8.  -137/120 

16.7.9.  -2/e 

16.7.10.  7r62(-46^  -  36^  +  QaH'^  +  6a^)/6 

16.7.11.  4/3kg/s 

16.7.12.  24eo 

16.8.1.  -37T 

16.8.2.  0 

16.8.3.  -47r 

16.8.4.  Stt 

16.8.5.  A{p{c  -b)  +  q{a-c)  +a-b) 

16.9.1.  both  are  -457r/4 

16.9.2.  a^bc  +  ab^c  +  abc^ 

16.9.3.  -2e  +  7/2 

16.9.4.  3 

16.9.5.  3847r/5 

16.9.6.  7r/3 

16.9.7.  IOtt 

16.9.8.  7r/2 

17.1.2.  y  —  aidant  +  C 


17.1.3.  y 


+  1 


n  +  1 

17.1.4.  y^tlnt-t  +  C 

17.1.5.  y  =  nn,  for  any  integer  n. 

17.1.6.  none 

17.1.7.  y  =  ±A/t2  +  c 
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17.1.8. 

17.1.9. 
17.1.10. 
17.1.11. 

17.1.12. 

17.1.13. 
17.1.14. 
17.1.15. 

17.1.16. 
17.1.17. 


y  =  ±l,y={l  +  Ae^')/{l-Ae 

y  =  {2t/3f/' 
y  =  M  +  Ae-''* 

101n(15/2)  . 

 — —  ^  2.52  minutes 

mo 

M 

y-  1  +  Ae-^''^ 
y  =  2e3V2 
In  2 


,2t 


t 


k 


Ri„o/c-  5  In  300 

600e-^'"2/^     261  mg;  —  

m2 

41  days 

In  2/191    ^    48  ^g. 

5730  In  50  ^^^^^ 
 32339  years 


In  2 


17.1.18. 

y 

17.1.19. 

17.2.1. 

y 

Ae-''' 

17.2.2. 

y 

Ae^' 

17.2.3. 

y 

arctan  t 

17.2.4. 

y 

Ae-'"l^ 

17.2.5. 

y 

17.2.6. 

y 

-26^^-3 

17.2.7. 

y 

gl+COS  t 

17.2.8. 

y 

17.2.9. 

y 

0 

17.2.10. 

y 

0 

17.2.11. 

y 

4t2 

17.2.12. 

y 

_2e(V*)-i 

17.2.13. 

y 

17.2.14. 

y 

0 

17.2.15. 

k 

ln5,  y  =  1( 

210  g 


1  T  O  1  R 

h  — 

—LZ/ 16,  y  —  exp[— 

17.  J. 17. 

y  = 

in6„t  ln(3/2) 

10  e    ^  '  ' 

y  — 

-|Q  -iln(2)/6 

J- L/C. 

17.3.1. 

y  — 

17.3.2. 

y 

Ae^*  -  3 

17.3.3. 

^e-(V2)t^+5 

17.3.4. 

y  = 

Ae-"'  -  2 

17.3.5. 

y  = 

Ae*  -     _  2t  -  2 

17.3.6. 

y  = 

Ae"*/2  +  t  -  2 

17.3.7. 

y  = 

3t 

17.3.8. 

17.3.9. 

17.3.10. 

17.4.1. 
17.4.2. 
17.4.3. 
17.4.4. 

17.5.4. 

17.5.5. 

17.5.6. 

17.5.7. 

17.5.8. 

17.5.9. 
17.5.10. 
17.5.11. 
17.5.12. 
17.5.13. 
17.5.14. 
17.5.15. 


y  = 
y  = 
y  = 


C  2  r- 

A  cos  t  +  sint 
A 


sec  t  +  tan  t 
f»  1.355 


+  1 


t 


sec  t  +  tan  t 


y{l)  40.31 
?/(l)  1.05 
2/(1)  fti  2.30 


1 


2a;  2a; 
2cos(3t)  +  5sin(3t) 

-(l/4)e-^*  +  (5/4)e-* 

-2e-3*  +  2e^' 

5e"^*  +  20te"^* 

(16t  -  3)e^^ 

-2cos(v/5t)  +  V5sm{V5t) 
— -\/2cost  +  -\/2sint 
e-^*  (4  cos  t  + 24  sin  t) 

2e"^*  sin(3t) 

2cos(2t-7r/6) 

5V2cos(10t-7r/4) 
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17.5.16.  ^e--  cos(3t  -  ^4)  17.6.20.             cos(3t)  +  ||  sin(3t))  + 

17.5.17.  5e^*cos(3t  +  arcsin(4/5))  ^ 

17.5.18.  (2cos(5t)  +  sin(5t))e-2*  325      ^  ~  325  ^'"^  * 

17.5.19.  -(l/2)e-2*sin(2i)  17.6.21.  e-2*(^sin(5t)  +  Scos(5t))  + 

17.6.1.  Ae^^  +  Bte-^  +  (6/169)  cost  -  ^'^^(^t)  +  25cos(2t) 
(5/338)  sini  17.6.22.  e-2*(^sin(2t)  +  5cos(2t))  + 

(14/195)  sin  t- (8/195)  cost 

17.6.2.  Ae-^*  +  Ste-^2t     5  \      ,  /  ;  / 

17.7.1.  ^sin(t) +  Scos(t) - 

17.6.3.  Acos(4t)  +  Ssin(4t)  +  (1/2)^^  +  costln  |  sect  +  tant| 
(3/16)t-5/16  1 

r-                r-  17.7.2.  A  sin(t) +Scos(t)  + -e^* 

17.6.4.  ^cos(v^t)  +  Ssin(v^t)-(cos(5t)+  ''5 

sin(5t))/23  17.7.3.  Asin(2t)  +  i?cos(2t)  +  cost  - 

^  -  ^  „         .       ,         •   .\      9/ /r.  sin  t  cos  tin  I  sect  +  tan  t| 

17.6.5.  e*(^cost  + 5sint)  +  6^72  '  ' 

^             ^  17.7.4.  ylsin(2t)  +  Scos(2t)  + 

17.6.6.  Ae^^*  +  Se-^^*  +  2  -  t/3  -  e-75  1 

7^sin(2t)sin2(t)  + 

17.6.7.  +  Be^'  -  (l/5)te-^*  2      ^    ^       ^  ^ 

1  t 

17.6.8.  Ae*  +  Se^*  +  (1  /2)te3*  2  ^'"''^^^^  In  |  cos  t|  -  -  cos(2t)  + 

17.6.9.  Acos(4t)  +  Ssin(4t)  +  (l/8)tsin(4t)  ^  sin(2t)  cos(2t) 

17.6.10.  ^cos(3t)  +  5sin(3t)-(l/2)tcos(3t)  ^3 

17.6.11.  ^e-«*  +  Ste-6*  +  3t2e-6*  +  +  1^^'*  " 

17.6.12.  Ae^*  +  Ste^*  -  t^e^*  (---+   ^  ^ 


5      25     125  /  5 

17.6.13.  Ae"*  +  Se"^*  +  (4/5)  „  „       '     .  „  , 

^  '  '  17.7.6.  ^e*sint  +  Be*  cost  - 

17.6.14.  Ae^*  +  Be-''  +  (1/144)  -  (t/12)  ^^^^^^  |  ^^^^  ^  ^^^^| 

17.6.15.  ^cos(y5t)  +  Ssin(y5t)  +  8sin(2t)  ^  ^   ^e*sint+Se*cost-l  cost(cos^t+ 

17.6.16.  Ae''*  +  Be-^'  +  te^'  „  .  3^      ^      ^  ,  ^ 

dsm  t  ^  2 cost  —  smtj  + 

17.6.17.  4e*  +  e-*-3t-5  1  ^  „ 

—  sin  t(sin  t  —  3  cos  t  —  2  sin  t  + 

17.6.18.  -(4/27)sin(3t)  +  (4/9)t  10  ^  ^ 

17.6.19.  e-672cost  +  20sint)  +  26"^*  ^°^^)  =  10  ^^"^^t)  -  -  sin(2t) 


B 

Useful  Formulas 


Algebra 


Remember  that  the  common  algebraic  operations  have  precedences  relative  to  each  other: 
for  example,  multiplication  and  division  take  precedence  over  addition  and  subtraction,  but 
are  "tied"  with  each  other.  In  the  case  of  ties,  work  left  to  right.  This  means,  for  example, 
that  l/2x  means  (1/2)2;:  do  the  division,  then  the  multiplication  in  left  to  right  order.  It 
sometimes  is  a  good  idea  to  use  more  parentheses  than  strictly  necessary,  for  clarity,  but 
it  is  also  a  bad  idea  to  use  too  many  parentheses. 

Completing  the  square:      +  hx  +  c  =  {x  +       —  ^  +  c. 

^     1     ■    r       11              r     9     7               — 6  ±  \J\P-  —  Aac 
Quadratic  formula:  the  roots  of  ax  +  ox  +  c  are  . 


2a 


Exponent  rules 


a 


b+c 


a 


b 


a 


b-c 


a 


i/b 


^a 


Geometry 


Circle:  circumference  =  27rr,  area  =  7rr^. 
Sphere:  vol  =  47rr^/3,  surface  area  =  Airr^. 
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Cylinder:  vol  =  irr^h,  lateral  area  =  27rrh,  total  surface  area  =  27rrh  +  27rr^. 


Cone:  vol  =  nr'^h/S,  lateral  area  =  7Tr\/r^~+~h^,  total  surface  area  =  Trr^/r"^  +  h?  + 


Analytic  geometry 

Point-slope  formula  for  straight  line  through  the  point  (a;o,yo)  with  slope  m:  y 
yo  +  m{x  -  xq). 

Circle  with  radius  r  centered  at  {h,k):  {x  —  h)'^  +  {y  —  k)"^  =  r'^ . 

xP'  y'^ 

Ellipse  with  axes  on  the  x-axis  and  y-axis:  ^  +  -77  =  1. 


Trigonometry 

sin(6') 

=  opposite/hypotenuse 

cos(^) 

=  adjacent /hypotenuse 

tan(^) 

=  opposite/adjacent 

sec(^) 

=  l/cos(^) 

csc(^) 

=  l/sin(^) 

cot(^) 

=  l/tan(^) 

tan(6') 

=  sin(^)/cos(^) 

cot(^) 

=  cos(^)/sin(^) 

sin(6') 

=  cos(f 

cos(^) 

=  sin(f 

sin(6'  +  tt)  =  -  sin(6') 

cos(6'  +  tt)  =  —  cos(6') 

Law  of  cosines:      =  b'^  +     —  2bccosA 

.  a  b  c 

Law  01  sines:   7  =  — — —  =  — — — 

sm  A      sm  B      sm  C 

Sine  of  sum  of  angles:  sin(a;  +  y)  =  sin  x  cos  y  +  cos  x  sin  y 
Sine  of  double  angle:  sin(2x)  =  2  sin  a:  cos  a: 
Sine  of  difference  of  angles:  sin  (a;  —  y)  —  sin  a;  cosy  —  cos  a;  sin  y 
Cosine  of  sum  of  angles:  cos  (a;  +  y)  =  cos  a;  cosy  —  sin  a;  sin  y 
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Cosine  of  double  angle:  cos(2a;)  =  cos^  x  —  sin^  x  =  2  cos^  x  —  1  =  1  —  2  sin^  x 
Cosine  of  difference  of  angles:  cos  (a;  —  y)  =  cos  x  cosy  +  sin  a;  sin  j/ 

r  o      ^  /        \       tciii  ^  +  tan  y 

Tangent  of  sum  of  angles:  tan  (a;  +  y )  =   

1  —  tan  x  tan  y 

sin^(^)  and  cos^(6')  formulas: 

sin2(^)  +cos2(^)  =  1 

tan2(6')  +  1  =  sec^{e) 


2 

cos2(^)  = 


2/m  _  1  +  cos(26') 
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A 

absolute  extremum,  119 
acceleration 
normal  component,  345 
tangential  component,  345 
acceleration  vector,  345 
algebraic  precedence,  505 
alternating  harmonic  series,  273 
anti-parallel  vectors,  309 
antiderivative,  155 
arc  length,  339 

arc  length  parameterization,  340 

arccosine,  96 

arcsine,  94 

asymptote,  23,  115 

average,  403 

average  height,  383 

B 

bell  curve,  226 
binormal,  344,  436 
bounded  function,  54 

c 

cardioid,  242 

Cartesian  coordinates,  241,  323 
Cauchy  Principal  Value,  220 
center  of  mass,  403 
center  of  mass,  215 
centroid,  216,  400 


chain  rule,  67 

characteristic  polynomial,  468 

chord,  32 

circle 

area,  505 

circumference,  505 

equation  of,  21,  506 

unit,  21 
Clairaut's  Theorem,  371 
closed  paths,  425 
completing  the  square,  505 
composition  of  functions,  27,  45,  67 
concave  down,  114 
concave  up,  114 
cone 

lateral  area,  506 

surface  area,  506 

volume,  506 
conservative  vector  field,  424 
continuous,  55 
convergent  sequence,  259 
convergent  series,  264 
coordinates 

Cartesian,  241,  323 

converting  rectangular  to  polar,  242 

cylindrical,  323 

polar,  241,  323 

rectangular,  241,  323 

spherical,  325 
cosines 

law  of,  307,  506 
critical  value,  108 
cross  product,  313 
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cumulative  distribution  function,  226 

curvature,  342 

curvature  formula,  342,  343 

cycloid,  251 

cylinder 

lateral  area,  506 

surface  area,  506 

volume,  506 
cylindrical  coordinates,  405 
cylindrical  coordinates,  323 

D 

definite  integral,  159 
del  (V),  367 
dependent  variable,  24 
derivative,  34 

dot  notation,  132 

Leibniz  notation,  49 
difference  quotient,  32 
differentiable,  55,  362 
differential,  144 
differential  equation,  453 

first  order,  454 

particular  solution,  456 

separable,  456 
directional  derivative,  367 
discrete  probability,  223 
discriminant,  372 
displacement  vector,  302 
divergence  test,  266 
Divergence  Theorem,  448 
divergent  sequence,  259 
divergent  series,  264 
domain,  23 
dot  notation,  132 
dot  product,  308 
double  integral,  385 

E 

ellipse 

equation  of,  506 
ellipsoid,  239 
error  estimate,  185 
escape  velocity,  222 
Euler's  Method,  464 
exp  function,  85 
expected  value,  224 
exponential  distribution,  227 
exponential  function,  82 
Extreme  Value  Theorem,  121 


F 

Fermat's  Theorem,  108 
flux,  441 
frustum,  235 
Fubini's  Theorem,  387 
function,  22 

bounded,  54 

differentiable,  55 

implicit,  90 

linear,  22 

of  two  variables,  356 

rational,  179 

unbounded,  54 
function  composition,  27 
Fundamental  Theorem  of  Algebra,  111 
Fundamental  Theorem  of  Calculus,  153 

G 

Gabriel's  horn,  223,  238 
geometric  series,  264 
global  extremum,  119 
gradient,  367 
greatest  integer,  110 

H 

harmonic  series,  267 

alternating,  273 
Hooke's  Law,  211 
hyperbolic  cosine,  102 
hyperbolic  sine,  102 
hypercycloid,  252 
hypocycloid,  252 

/ 

implicit  differentiation,  89,  365 
implicit  function,  90 
improper  integral,  219 

convergent,  219 

diverges,  219 
incompressible,  434 
indefinite  integral,  159 
independent  variable,  24 
inflection  point,  114 
initial  condition,  454 
initial  value  problem 

first  order,  454 
integral 

improper,  219 
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of  sec  a;,  175 

of  sec^  X,  175 

properties  of,  163 
integral  sign,  155 
integral  test,  270 
integration 

by  parts,  176 
Intermediate  Value  Theorem,  55 
interval  of  convergence,  283 
inverse  function,  82 
inverse  sine,  94 
involute,  252 

J 

Jacobian,  413 
judicious  guessing,  471 

K 

kinetic  energy,  221 
L 

L'Hopital's  Rule,  98 
Lagrange  multipliers,  377 
lateral  area  of  a  cone,  130 
law  of  cosines,  307,  506 
law  of  sines,  506 
Leibniz  notation,  49 
level  curve,  351 
level  set,  352 
level  surface,  352 
limit,  41 

limit  at  infinity,  98 

limit  of  a  sequence,  259 

line  integral,  419 

linear  approximation,  143 

linearity  of  the  derivative,  60 

local  extremum,  107 

local  maximum,  107 

local  minimum,  107 

logarithm,  82 

logarithmic  function,  82 

logistic  equation,  458 

long  division  of  polynomials,  181 

M 

Maclaurin  series,  287 
mass,  403 
mean,  224,  227 


Mean  Value  Theorem,  146 
moment,  215,  398,  404 

N 

Newton,  210 

Newton's  law  of  cooling,  453 
Newton's  method,  139-141 
normal,  317,  344 
normal  distribution,  231 

o 

one  sided  limit,  46 
optimization,  119 
orientable  surface,  442 
oriented  curve,  427 

P 

p-series,  270 
parallel  vectors,  309 
parallelogram 

and  vector  sum,  303 

area  of,  315,  400,  412 
parametric  equations,  250,  320,  330 
partial  fractions,  179 
particular  solution,  456 
physicists,  132 
point-slope  formula,  506 
polar  coordinates,  241,  323 
polynomial 

of  two  variables,  356 
power  function,  57 
power  rule,  57,  89,  92 
precedence 

of  algebraic  operations,  505 
probability  density  function,  225 
product  rule,  62,  63 

generalized,  64 
projection,  421 

scalar,  421 
projection  of  a  vector,  309 
properties  of  integrals,  163 

Q 

quadratic  formula,  505 
quotient  rule,  64 

R 

radian  measure,  73 
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radius  of  convergence,  283 
random  variable,  224 
rational  function,  66,  179 
rectangular  coordinates,  241,  323 
related  rates,  131 
resonant  frequency,  475 
right  hand  rule,  316 
Rolle's  Theorem,  146 

s 

scalar  multiplication,  304 
scalar  projection,  421 
separation  of  variables,  456 
sequence,  258 

bounded,  262 

bounded  above,  262 

bounded  below,  262 

convergent,  259 

decreasing,  262 

divergent,  259 

increasing,  262 

monotonic,  262 

non-decreasing,  262 

non-increasing,  262 

of  partial  sums,  264 
series,  258 

p-series,  270 

absolute  convergence,  278 

alternating  harmonic,  273 

conditional  convergence,  278 

convergent,  264 

divergent,  264 

geometric,  264 

harmonic,  267 

integral  test,  270 

interval  of  convergence,  283 

Maclaurin,  287 

radius  of  convergence,  283 

Taylor,  290 
Simpson's  Rule,  187 
sines 

law  of,  506 
slope  field,  466 
sphere 

surface  area,  505 

volume,  505 
spherical  coordinates,  325,  405 
spiral  of  Archimedes,  243 
squeeze  theorem,  77 
standard  deviation,  229 
standard  normal  distribution,  227 


standard  normal  probability  density 

function,  226 
steady  state  part  of  solution  to  d.e.,  474 
substitution,  409 
subtend,  74 
sum 

of  vectors,  303 
sum  rule,  61 
surface  area,  400 

T 

tangent  line,  32 

Taylor  series,  290 

Toricehi's  trumpet,  223,  238 

torque,  213 

torus,  238 

transcendental  function,  73 
transient  part  of  solution  to  d.e.,  474 
Trapezoid  Rule,  184 
triangle  inequality,  43,  302 
trigonometric  identities,  506 

u 

unbounded  function,  54 

undetermined  coefficients,  471 

uniform  distribution,  226 

uniform  probability  density  function,  226 

unit  normal,  344 

unit  binormal,  344 

unit  circle,  21 

unit  vector,  310,  336 

V 

variance,  229 

variation  of  parameters,  461,  475 
vector,  302 

anti-parallel,  309 

cross  product,  313 

displacement,  302 

function,  330 

normal  to  a  plane,  317 

parallel,  309 

projection,  309 

scalar  multiplication,  304 

sum,  303 

unit,  310 
vector  fields,  417 
velocity,  37 
velocity  vector,  345 
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w 


witch  of  Agnesi,  66 
work,  209 


